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MAMMALIAN PROTEIN WITH PUTATIVE FUNCTION IN SIGNAL 

TRANSDUCTION 

FIELD OF THE INVENTION 

5 The present invention relates to compounds and methods for the identification, prevention or treatment of 

disease, preferably cardiac disease, in a mammal through the administration of polypeptides capable of regulating 
signal transduction, which preferably exhibit kinase activity, or antibodies against such polypeptides that inhibit the 
interaction of these polypeptides with other mediators of signal transduction. In addition, the compounds and methods 
of the present invention can facilitate the identification or isolation of additional mediators of signal transduction 

10 associated with disease, preferably cardiac disease, which in turn may also be used in the identification, prevention or 
treatment of disease, preferably cardiac disease, in mammals. 

BACKGROUND OF THE INVENTION 
Certain biological functions, such as growth and differentiation, are tightly regulated by signal transduction 
1 5 pathways within cells. Signal transduction pathways maintain the balanced steady state functioning of a cell. Disease 
states can arise when signal transduction in a cell breaks down, thereby removing the control that typically exists over 
cellular functions. Because signal transduction networks regulate a multitude of cellular functions depending upon the 
cell type, a wide variety of diseases can result from abnormalities in such networks. Devastating diseases such as 
cancer, autoimmune diseases, allergic reactions, inflammation, neurological disorders and hormone-related diseases can 
20 result from abnormal signal transduction. For example, tumors may develop when regulation of cell growth is 
disrupted. 

Despite a long-felt need to understand and discover methods for regulating cells involved in various disease 
states, the complexity of signal transduction pathways has precluded the development of products and processes for 
regulating cellular function by manipulating signal transduction pathways in a cell. As such, there remains a need for 

25 products and processes that permit the implementation of predictable controls of signal transduction in cells, thus 
enabling the treatment of various diseases that are caused by abnormal cellular function. 

Such diseases may include cardiac diseases, which may include congestive heart failure (CHF), dilated 
congestive cardiomyopathy, hypertrophic cardiomyopathy, restrictive cardiomyopathy, mitral valve disease, aortic 
valve disease, tricuspid valve disease, angina pectoris, myocardial infarction, cardiac arrhythmia, pulmonary 

30 hypertension, arterial hypertension, renovascular hypertension, arteriosclerosis, atherosclerosis, and cardiac tumors. 
By way of example, CHF is a major cardiac disease associated with extensive morbidity and mortality. Approximately 
five million individuals in the United States suffer from some form of CHF. Traditionally, treatment of CHF occurs by a 
series of agents including diuretics, vasodilators, angiotensin converting enzyme inhibitors, B-adrenergic antagonists, 
and positive inotropes like digoxin. These drugs, however, principally provide symptomatic relief and typically only 

3 5 extend the life of one suffering from the disease for periods ranging from 6- 1 2 months. 
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The pathophysiology of CHF is rather complex. Generally, the central hallmark of the disease is the inability 
of the heart to pump sufficient oxygenated blood to meet the demands of peripheral tissues. Numerous etiologies 
contribute to the development of CHF, including primary diseases of, or insults to, the myocardium itself, cardiac 
defects, hypertension, inflammation, kidney disease and vascular disease. These conditions lead to the hypertrophy 
5 and remodeling of the cardiac ventricles which, if unchecked, ultimately reduce the mechanical performance of the 
heart. Forces associated with the inability of the heart to pump blood ultimately lead to the release of neurohormones 
like catecholamines, renin-angiotensin, aldosterone, endothelin and related factors into the circulation. Elevations in 
plasma levels of many of these circulating neurohormones have a deleterious impact on the outcome of patients with 
CHF. Local production of these neurohormonal factors in the heart is believed to contribute centrally to the disease. 
10 Thus, an important therapeutic strategy has been to block this neurohormonal axis contributing to the pathogenesis of 
this disease. 

Factors known to contribute centrally to the pathophysiology of heart disease are biosynthesized in the heart 
itself. These factors are produced in cardiac myocytes, fibroblasts, smooth muscle and endothelial cells, and 
inflammatory cells associated with the myocardium. For example, the heart contains its own renin-angiotensin system. 

15 Blockade of the cardiac renin-angiotensin system may contribute significantly to the therapeutic efficacy of the 
therapeutic class of agents known as angiotensin converting enzyme (ACE) inhibitors. 

The heart also produces other factors including endothelins, bradykinin, adrenomeduliin, tumor necrosis 
factor, transforming growth factors, and natriuretic peptides. Unfortunately, therapeutic strategies are limited to the 
modulation of such substances, which are already known to contribute to the disease. Indeed, the functional 

20 contributions of only a minor fraction of all known secreted factors encoded by the human genome have apparently 
been defined. 

The foregoing shows a need for methods and products involving the prevention or treatment of disease in 
mammals involving the mediation of signal transduction. The administration of polypeptides capable of regulating 
signal transduction, which preferably exhibit kinase activity, or antibodies against such polypeptides that inhibit the 
25 interaction of these polypeptides with other mediators of signal transduction, in addition to the identification or 
isolation of additional mediators of signal transduction associated with disease, preferably cardiac disease, which in 
turn may also be used in the identification, prevention or treatment of disease, preferably cardiac disease, in mammals, 
can facilitate such prevention or treatment. 

30 SUMMARY OF THE INVENTION 

An objective of the present invention is therefore the prevention or treatment of disease, preferably cardiac 
disease, in mammals through the administration of polypeptides capable of regulating signal transduction, which 
preferably exhibit kinase activity, or antibodies against such polypeptides that inhibit the interaction of these 
polypeptides with other mediators of signal transduction, in addition to the identification or isolation of additional 
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mediators of signal transduction associated with disease, preferably cardiac disease, which in turn may also be used 
in the identification, prevention or treatment of disease, preferably cardiac disease, in mammals. 

In accomplishing these and other objectives, the present invention preferably provides a purified polypeptide 
comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is capable of regulating 

5 signal transduction. In a preferred embodiment, the polypeptide is capable of catalyzing the transfer of a phosphate 
group from a donor molecule to an acceptor molecule. 

In another embodiment, the present invention preferably provides an isolated DNA molecule encoding a 
purified polypeptide comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is 
capable of regulating signal transduction. The present invention may also preferably be an isolated DNA molecule 

1 0 comprising the nucleotide sequence of SEQ ID NOS: 2, 3 or 5. 

In yet another embodiment, the present invention preferably provides a vector comprising a DNA molecule 
encoding a purified polypeptide comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the 
polypeptide is capable of regulating signal transduction. In another aspect, the present invention provides a host cell 
transformed with such a vector. In one other embodiment, the present invention may preferably provide the above- 

1 5 described transformed host cell, where the host cell produces a polypeptide capable of regulating signal transduction. 
In a preferred embodiment, the above-described transformed host cell produces a polypeptide capable of catalyzing the 
transfer of a phosphate group from a donor molecule to an acceptor molecule. 

In another preferred embodiment, the present invention provides an isolated antibody against a polypeptide 
comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is capable of regulating 

20 signal transduction. In a preferred embodiment, the antibody is a monoclonal antibody. In another preferred 
embodiment, the antibody is capable of inhibiting the regulation of signal transduction. In yet another preferred 
embodiment, the antibody is capable of inhibiting the transfer of a phosphate group from a donor molecule to an 
acceptor molecule. 

The present invention may also preferably provide an isolated nucleic acid capable of hybridizing under high 
25 stringency conditions to a DNA molecule comprising the nucleotide sequence of SEQ ID NOS: 2, 3 or 5. In a preferred 
embodiment, this isolated nucleic acid is capable of inhibiting the regulation of signal transduction. In yet another 
preferred embodiment, this isolated nucleic acid is capable of inhibiting said transfer of said phosphate group from said 
donor molecule to said acceptor molecule. 

In a preferred embodiment, the present invention provides a method of preventing or treating disease in a 
30 mammal comprising administering to said mammal an effective amount of material, selected from the group consisting 
of the polypeptide comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is 
capable of regulating signal transduction, and the antibody against this polypeptide, in a pharmaceutical^ acceptable 
sterile vehicle. In a preferred embodiment, the mammal may be a human. In another, the disease may be cardiac 
disease. 
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The present invention may also preferably provide a vaccine for preventing disease in a mammal comprising 
administering to said mammal an effective amount of material, selected from the group consisting of the polypeptide 
comprising the amino acid sequence of SEQ ID NOS: 1,4, 6, 7, 8 or 9 wherein the polypeptide is capable of regulating 
signal transduction, and the antibody against this polypeptide, in a pharmaceutical acceptable sterile vehicle. In a 
5 preferred embodiment, the mammal may be a human. In another, the disease may be cardiac disease. 

In a preferred embodiment, the present invention provides a method of preventing or treating disease in a 
mammal comprising administering to said mammal syngeneic cells transformed with a vector comprising a DNA 
molecule encoding a purified polypeptide comprising the amino acid sequence of SEQ ID NOS: 1 f 4, 6, 7, 8 or 9 
wherein the polypeptide is capable of regulating signal transduction, and wherein the transformed syngeneic cells 
10 produce a polypeptide capable of regulating signal transduction. In a preferred embodiment, the mammal may be a 
human. In another, the disease may be cardiac disease. 

The present invention may also preferably provide a method of preventing or treating disease in a mammal 
comprising administering to said mammal syngeneic cells transformed with a vector comprising a DNA molecule 
encoding a purified polypeptide comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the 
15 polypeptide is capable of regulating signal transduction, and wherein the transformed syngeneic cells produce a 
polypeptide capable of catalyzing the transfer of a phosphate group from a donor molecule to an acceptor molecule. In 
a preferred embodiment, the mammal may be a human. In another, the disease may be cardiac disease. 

In a preferred embodiment, the present invention may provide a kit for detecting the expression of a protein 
capable of regulating signal transduction, comprising a polypeptide, which comprises the amino acid sequence of SEQ 
20 ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is capable of regulating signal transduction. In a preferred 
embodiment, this kit further comprises a detectable label selected from the group consisting of colorimetric, 
enzymatic, chemiluminescent, fluorescent and radioactive labels. 

In another preferred embodiment, the present invention may provide a kit for detecting the expression of a 
protein capable of acting as a donor molecule or an acceptor molecule of a phosphate group comprising a polypeptide, 
25 which comprises the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is capable of 
regulating signal transduction. In a preferred embodiment, this kit further comprises a detectable label selected from 
the group consisting of colorimetric, enzymatic, chemiluminescent, fluorescent and radioactive labels. 

The present invention may also preferably provide a method for detecting the expression of a protein capable 
of regulating signal transduction, comprising contacting a sample with a polypeptide, which comprises the amino acid 
30 sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the polypeptide is capable of regulating signal transduction, and 
detecting any effect of the sample on an indicator of signal transduction. In a preferred embodiment, the polypeptide 
is immobilized to a solid support. In another preferred embodiment, the phosphate group is detectably labeled. 

In another preferred embodiment, the present invention may provide a method for detecting the expression of 
a protein capable of acting as a donor molecules an acceptor molecule of a phosphate group, comprising contacting a 
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sample with a polypeptide, which comprises the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 wherein the 
polypeptide is capable of regulating signal transduction, and detecting any effect of the sample on an indicator of 
signal transduction, and detecting any transfer of the phosphate group. In a preferred embodiment, the polypeptide is 
immobilized to a solid support. In another preferred embodiment, the phosphate group is detectably labeled. 
5 Other objectives, features, and advantages of the present invention will become apparent from the following 

detailed description. The detailed description and the specific examples, while indicating preferred embodiments of 
the invention, are provided by way of illustration only. Accordingly, the present invention also includes those various 
changes and modifications within the spirit and scope of the invention that may become apparent to those skilled in 
the art from this detailed description. 

10 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 illustrates eight overlapping human cDNA clones of 1965. 

Figure 2 depicts homology alignment of amino acid sequences of human, rat and mouse clones corresponding 

to 19G5. 

15 Figure 3 depicts the likely gene structure of human 19G5 (H19G5), which reveals at least 10 exons and 9 

introns. 

Figure 4 is a schematic diagram of four cDNA clones corresponding to splicing variants of H19G5. The 
longest clone {C1 1) contains two kinase domains, the N-terminal and the Cterminal kinase domains (N-KD and C-KD). 
Figure 5 is a schematic diagram comparing the domain structure of 19G5 and Trio proteins. 
20 Figure 6 is the sequence comparison of the kinase domains of H19G5 (N-terminal and C-terminal kinase 

domains), Trio, and smooth muscle myosin light chain (SM MLCK). 

Figure 7 is the sequence comparison of the guanine nucleotide exchange factor (GEF) domains Df H19G5 and 

Trio. 

Figure 8 depicts a Northern blot analysis of various rat tissue samples using rat 1 9G5 DNA as a probe. Heart 
25 tissue reveals two transcripts (2 kb and a 4.4 kb) of R19G5. Multiple bands of weak hybridization are seen in skeletal 
muscle. Parallel Northern analysis of various rat tissue samples using a (3-actin probe suggests approximate equal 
input of mRNA in various samples and integrity of mRNA. 

Figure 9 depicts a Northern blot analysis of various human tissue samples using H19G5 DNA probe. Heart 
tissue reveals multiple transcripts (5 kb, 3 kb, 2.4 kb and 1.8 kb) derived from H19G5 gene. Skeletal muscle and brain 
30 tissues show multiple bands of weak hybridization. 

Figure 10 depicts a Southern blot analysis of human genomic DNA using H19G5 DNA as a probe (SEQ ID NO: 

2). 

Figure 1 1 shows the subcellular localization of 19G5-GFP fusion proteins in mouse myoblast cell line C2C12. 
Three 19G5-GFP fusion protein expression constructs were made using three different 19G5 cDNA clones, the longest 
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human 19G5 clone C11 [h19G5{Cl1)-GFP], a 2.7 kb clone of human 19G5 containing the C-terminal kinase domain 
[h19G59F1)-GFP], and the rat 19G5 small transcript [r19G5(S)-GFP]: The control GFP vector and the 19G5-GFP 
fusion expression constructs were transfected into C2C12 cells. The 19G5GFP fusion proteins' localization was 
detected using conf ocal microscopy. 
5 Figure 12 shows that H19G5 protein expressed by the clone C1 1 binds to the small G protein Cdc42. The 

lysate of 293 EBNA cells transfected with H19G5C11 expression construct was incubated with GST-Cdc42 
immobilized on glutathione-agarose. After washing, the complex was resuspended in SDS sample buffer, boiled and 
run on a SDS-PAGE, and Western blotted with an anti H19G5 monoclonal antibody. 

Figure 13 shows phase contrast micrographs of C2C12 myoblasts undergoing differentiation into myotubes. 
10 Undifferentiated C2C12 cells are shown in Day 0. Differentiation of C2C12, induced by placing in 2% horse serum, is 
apparent on Day 3 and Day 5 as shown. 

Figure 14 shows induction of myogenin protein expression when C2C12 myoblast cells are induced to 
differentiate into myotubes. Myogenin is a marker of myotubes. 

Figure 15 shows induction of 19G5 RNA expression during the differentiation of myoblasts into myotubes. 
1 5 Figure 1 6 shows inhibition of the induction of 1 9G5 expression by TFG-p during C2C12 differentiation. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Those skilled in the art will recognize that the products and methods embodied in the present invention may 
be applied to a variety of systems, constructed with various materials using various methods. Accordingly, the 
20 present invention is not limited to any particular environment, and the following description of specific embodiments of 
the present invention are for illustrative purposes only. 

The present invention preferably provides methods for the prevention or treatment of disease, preferably 
cardiac disease, in mammals through the administration of polypeptides capable of regulating signal transduction, 
which preferably exhibit kinase activity, or antibodies against such polypeptides that inhibit the interaction of these 
25 polypeptides with other mediators of signal transduction, in addition to the identification or isolation of additional 
mediators of signal transduction associated with disease, preferably cardiac disease, which in turn may also be used 
in the identification, prevention or treatment of disease, preferably cardiac disease, in mammals. The cardiac diseases 
according to the present invention may include congestive heart failure (CHF), dilated congestive cardiomyopathy, 
hypertrophic cardiomyopathy, restrictive cardiomyopathy, mitral valve disease, aortic valve disease, tricuspid valve 
30 disease, angina pectoris, myocardial infarction, cardiac arrhythmia, pulmonary hypertension, arterial hypertension, 
renovascular hypertension, arteriosclerosis, atherosclerosis, and cardiac tumors. 

An embodiment of the invention is a purified polypeptide comprising an amino acid sequence having at least 
95% sequence identity with the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9. As used herein, polypeptide 
refers to a linear series of amino acid residues connected to one another by peptide bonds between the alpha-amino 
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groups and carboxy groups of adjacent amino acid residues. Additional covalent bonds between portions of the 
peptide may also be present to restrain the conformation of the molecule, such as amide and disulfide bonds. When 
used herein, polypeptide also refers to a linear series of amino acid residues connected one to the other as in a 
peptide. The term synthetic peptide means a chemically derived chain of amino acid residues linked together by 

5 peptide bonds that is free of naturally occurring proteins and fragments thereof. 

The one and three-letter symbols used to represent the amino acid residues in the polypeptides of the 
present invention are those symbols commonly used in the art. The amino acid residues are preferred to be in the L 
isomeric form. However, residues in the D isomeric form may be substituted for any L-amino acid, as long as the 
desired functional property of signal transduction mediation is retained by the peptide. The one and three-letter 

10 symbols used herein refer to the following amino acids: Ser (S) is serine; lie (I) is isoleucine; Gin |Q) is glutamine; Phe 
{F) is phenylalanine; His |H) is histidine; Trp (W) is tryptophan; Lys (K) is lysine; Asn (N) is asparagine; Leu (U is leucine; 
Gly (G) is glycine; Thr (T) is threonine; Asp (D) is aspartic acid; Arg (R)is arginine; and Ala (A) is alanine. 

Polypeptides of the present invention include variants, fragments and chemical derivatives of the 
polypeptides comprising the amino acid sequence of SEQ ID NOS: 1, 4, 6, 7, 8 or 9 as long as they are capable of 

15 mediating signal transduction. Polypeptides thus may include soluble peptides, Ig-tailed fusion peptides (including 
immunoadhesions), members of random peptide libraries [see, e.g., Lam, K.S. et a/., Nature 354:82-84 (1991); 
Houghten, R. et at., Nature 354:84-86 (1991)), combinatorial chemistry-derived molecular libraries made of D-and/or L- 
configuration amino acids, and phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries, see, e.g., Songyang, Z. et aL, Cell 72:767-778(1993)). 

20 Polypeptides of the present invention may also include polypeptides that can be isolated from nature or can 

be produced by recombinant and/or synthetic means. Such native sequence polypeptides specifically refers to 
naturally-occurring truncated or secreted forms [e.g., an extracellular domain sequence), as well as naturally occurring 
variant forms (e.g., alternatively spliced forms), and naturally occurring allelic variants of the named polypeptides. 

The term variant refers to any polypeptide having an amino acid sequence, in comparison to the amino acid 

25 sequences of the polypeptides of the present invention, in which one or more amino acids have been substituted with 
other amino acids; where the substituted amino acids allow or require the polypeptide to assume the equilibrium 
conformation of the domain of the parent protein. Often, cysteine, lysine and glutamic acid will be used for their side 
chains which can form covalent linkages to restrict the conformation of a peptide. The term variant refers to any 
polypeptide in which one or more amino acids are added and/or substituted and/or deleted and/or inserted at the N- or 

30 C-terminus or anywhere within the corresponding native sequence, and which retains signal transduction mediation 
activity of the corresponding native polypeptide. The variants herein preferably comprise a sequence that has at least 
about 80% sequence identity, more preferably at least about 85% sequence identity, even more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity, with the amino acid sequence of 
SEQ ID NOS: 1, 4, 6, 7, 8 or 9. 
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In such amino acid sequences, one or more amino acids in the fundamental sequence may preferably be 
substituted with another amino acid(s), the charge and polarity of which is similar to that of the native amino acid, 
i.e., a conservative amino acid substitution, resulting in a silent change. Substitutes for an amino acid within the 
fundamental polypeptide sequence can be selected from other members of the class to which the naturally occurring 

5 amino acid belongs. Amino acids can be divided into the following four groups: (1) acidic amino acids; (2) basic amino 
acids; 13) neutral polar amino acids; and (4) neutral non-polar amino acids. Representative amino acids within these 
various groups include, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and 
glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar amino 
acids such as glycine, serine, threonine, cyteine, cystine, tyrosine, asparagine, and glutamine; (4) neutral nonpolar 

10 (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine. 

The term variant shall also include any polypeptide having one or more amino acids deleted from or added to 
an amino acid sequence of a mediator of signal transduction, but which still retains signal transduction mediation 
activity. The term fragment shall refer to any shorter version of the polypeptides herein, wherein the fragment is 
1 5 capable of mediating signal transduction. 

Sequence identity is defined as the percentage of amino acid residues in a candidate sequence that are 
identical with the amino acid residues in a native polypeptide sequence, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substitutions as part of the sequence identity. The % sequence identity values are preferably generated by the NCBI 
20 BLAST2.0 software as defined by Altschul et al., (1997), "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs", Nucleic Acids Res., 25:3389-3402. The parameters are set to default values, 
with the exception of the Penalty for mismatch, which is set to -1. Other algorithms, such as GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package (Genetics Computer Groups, 575 Science Dr., 
Madison, Wl), are also suitable. The selection of the non-default parameters to achieve maximum sequence identity is 
25 well within the skill of a person skilled in the art. 

Antibodies of the present invention may include any polyclonal, monoclonal, humanized, anti-idiotypic, 
chimeric or single chain antibodies, and FAb, F(ab') 2 and FAb fragments, and epitope-binding fragments thereof. 

Without further elaboration, one skilled in the art with the preceding description can utilize the present 
invention to its fullest extent. The following examples are illustrative only, and not intended to limit the remainder of 
30 the disclosure in any way. 

Example 1 1solation and characterization of 19G5 clones from rat, mouse and human 

isolation of partial and full-length 19G5 clones from rat, mouse and human 
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Microarray technology was used to identify genes that are differentially expressed in normal and diseased 
rat heart. A sequence designated as 1965 was down regulated in 12-week myocardial infarct (Ml) rat hearts. A rat 
cDNA clone corresponding to 19G5 (R19G5) was isolated and nucleotide sequence determined. The deduced amino 
acid sequence of the clone revealed homology to the catalytic domain of kinases, thus suggesting that the protein 
5 product of R19G5 might be involved in signal transduction. A hybridization of multiple tissue Northern blot with the 
R19G5 probe showed that the gene is highly expressed in heart as a 2 kb and 4.4 kb transcripts (Figure 8). 

A full length cDNA for the 2 kb R19G5 transcript was cloned using 5' RACE technique. The R19G5 has an 
open reading frame of 1644 base pairs which encodes a protein of 548 amino acids (SEQ ID NO: 8) 

A mouse 19G5 (M19G5) EST clone was identified by searching the EST database. Northern blot 
10 hybridization using the M19G5 EST clone as probe detected a major transcript of 2 kb in heart. There is also a low 
level of expression in lung. The smeary hybridization was also detected as in the rat and human but was less 
prominent. M19G5 was also expressed in 17-day old mouse embryos suggesting that it may play a role in embryo 
development. Sequence analysis of the M19G5 clone showed that it is 1900 base pairs long and has an open reading 
frame of 1644 base pairs coding for a polypeptide of 548 amino acids (SEQ ID NO: 9), suggesting that it likely 
1 5 represents full length 2 kb transcript. 

A number of human cDNA clones corresponding to 19G5 (H19G5) were isolated from human cONA library 
using R19G5 as a probe. Figure 1 shows eight overlapping cDNA clones of H19G5. Extensive overlap among these 
clones helped build a consensus nucleotide sequence (SEQ ID NO: 2) designated as H19G5 contig. The sequence 
corresponds to a major transcript ("5 kb) expressed in human heart. The sequence of the contig revealed that it is 
20 complete at the 3'-end since it contains a polyadenyiation signal (AATAAA) as well as polyA residues at the 3'-end. It 
has a potential open reading frame, coding for 1351 amino acid residues, that extends beyond the 5'-end of the contig 
indicating that it is incomplete at the 5'-end. The deduced amino acid sequence (SEQ ID NO: 1) revealed a protein 
kinase domain at the Cterminal end (amino acid residues 1056 to 1309) and also a partial protein kinase domain 
towards the N-terminal end (amino acid residues 1 to 105) of this truncated clone. The sequence information of this 
25 contig (SEQ ID NO: 2) was used to devise antisense primers corresponding to the 5'-end, which were used in 5' RACE 
(rapid amplification of cDNA ends) to isolate cDNA clones with longer inserts. A cDNA clone containing full-length 
coding sequence was isolated. The nucleotide sequence of the clone (SEQ ID NO: 3) revealed an open reading frame 
that could potentially code for 1667 amino acids long full-length polypeptide (SEQ ID NO: 4). 

Using the 5' RACE technique, the full coding region for one of the larger H19G5 (3 kb or 5 kb) transcripts 
30 was cloned. It has an open reading frame of 2613 base pairs encoding a protein of 81 1 amino acids. 

Figure 2 shows the amino acid sequence alignment of the three full-length 19G5 proteins. Both the R19G5 
(SEQ ID NO: 8) and M19G5 (SEQ ID NO: 9) proteins overlap with the C-terminal two-third of a splicing variant of the 
H19G5 protein (SEQ ID NO: 7). The identity between the R19G5 and M19G5 proteins is 97%. The three proteins are 
85% identical in their sequences. 
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In order to understand the genomic organization of H19G5 gene, phage genomic library was screened and 
eleven H19G5 genomic clones were isolated. Restriction enzyme mapping of these clones detected no polymorphism, 
suggesting that H19G5 may be a single copy gene. The entire genomic DNA region encompassing H19G5 was 
sequenced and found to contain at least 10 exons and 9 introns as shown in Figure 3. 
5 One of the H19G5 cDNA clones contained deletion of two amino acid residues (alanine and proline) in the 

kinase domain. Sequence analysis of the genomic H19G5 DNA revealed an intron located immediately upstream of the 
alanine residue. There are two adjacent splicing acceptor sites that are four nucleotides apart at the 3' end of the 
intron. A splicing event using the first acceptor site generates a protein that includes the two residues, alanine and 
proline. Utilization of the second splicing acceptor site creates a H19G5 protein with the two amino acids deleted in 
10 its kinase domain. These two residues are highly conserved among all kinases. This deletion form of H19G5 protein 
may thus exhibit reduced or no kinase activity at all, and may serve as a dominant negative inhibitor of the kinase 
activity of H19G5. This provides a possible mechanism to regulate the activity of H1 9G5 in vivo. 

Isolation of cDNA clones of splicing variants of H19G5 

15 Multiple transcripts were detected in Northern analysis of human and rat cells, particularly in heart tissues 

(As shown in Figs. 8 and 9). However, the restriction analysis of the genomic clones and Southern analysis of genomic 
DNA suggested single-copy nature of H19G5 gene. These observations indicated the possibility of alternative splicing 
as a source of multiple transcripts. Four cDNA clones representing various splicing variants were isolated and are 
schematically shown in Figure 4. Complete nucleotide sequence (SEQ ID NO: 5) of the longest clone C11 was 

20 determined and the amino acid sequence of a large open reading frame contained therein was deduced (SEQ ID NO: 6). 
The large ORF has a potential to encode a protein of 2596 amino acid residues. The sequence analysis revealed the 
presence of a number of structurally and functionally important domains in H19G5. The presence of some of the 
domains strengthens the possibility of the involvement of H19G5 in signal transduction. For example, H19G5 has two 
kinase domains, one at the N-terminal (1094 to 1351 amino acid residues, N-KD) and the other at the C-terminal 

25 (2301 to 2553 amino acid residues, C-KD) end. H19G5 also has a Guanine nucleotide Exchange Factor (GEF) domain 
(325 to 504 amino acid residues), and a Pleckstrin Homology (PH) domain (532 to 634 amino acid residues). 
Additionally, H19G5 contains five Immunoglobulin (Ig) like domains distributed throughout the sequence. 

Structural and functional features of full-length H19G5 protein sequence 
30 The full-length H19G5 amino acid sequence (SEQ ID NO: 6) shows sequence similarity to some functionally 

important domains of a protein called Trio. The comparison of the domain structure of H19G5 and Trio is 
schematically depicted in Figure 5. Both kinase domains of HI 965 (N-KD and C-KD) are homologous to a single kinase 
domain of Trio as well as to a kinase domain of smooth muscle myosin light chain kinase (SM MLCK) as shown in 
Figure 6. The identity between H19G5 N-KD and Trio KD is about 40% and between H19G5 N-KD and SM MLCK is 
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about 38%. The sequence identity between the two kinase domains of H19G5 is about 30%. A single GEF domain of 
H19G5 is homologous to both GEF domains (GEF-D1 and GEF-D2) of Trio as shown in Figure 7. 

Proteins containing GEF domains are involved in signal transduction (for a review, see Cherfils and Chardin, 
Trends Biochem. Sci. 24: 306-311 [1999]). GEF domains promote exchange of GTP for GDP on GTP-binding proteins 

5 (G proteins) and thereby positively regulate their activities. As these proteins harbor intrinsic GTPase activity, they are 
also referred to as GTPases. These small G proteins, as opposed to trimeric G proteins, belong to a superfamily of Ras- 
like proteins. These proteins are bound to the inner face of plasma membrane, and usually exist in GDP-bound 
"inactive" state. When a ligand interacts with a membrane bound receptor, alteration of conformation allows the 
receptor to interact with a G protein. This interaction results in conformational change in the G protein that weakens 

10 the affinity for GDP and leads to replacement of GDP with GTP. This nucleotide exchange is greatly accelerated or 
promoted by proteins containing GEF domains. Once bound to GTP, the G proteins assume an "active" state in which 
they interact with the downstream effectors and facilitate transduction of signal from membrane to the nucleus. 
However, the activity of the G proteins is tightly controlled as their intrinsic GTPase activity rapidly hydrolyzes bound 
GTP into GDP and restores "inactive" status. Thus, G proteins function as molecular switches in signal transduction. 

1 5 A number of membrane receptors operate through G proteins. The downstream effectors of the activated G proteins 
include various protein kinases constituting a cascade of protein phosphorylation that brings about a desired change in 
gene expression. 

As discussed in the preceding section, GEF domains play a critical role in signal transduction by controlling 
the activation of G proteins. Trio is a complex protein possessing two GEF domains, each with adjacent pleckstrin 

20 homology (PH) domains and Src Homology-3 (SH3) domains, a protein serine/threonine kinase domain with an adjacent 
immunoglobulin-like domain and multiple spectrin-like domains (Medley et ah, J. Cell Sci. 112: 1825-1834 [1999]). 
Trio cDNA clone was isolated by virtue of its ability to interact with protein tyrosine phosphatase (PTP) domain of a 
protein called LAR (Debant et al., Proc. Natl. Acad. Sci. USA 93: 5466-5471 11996]). LAR is a broadly expressed 
transmembrane protein tyrosine phosphatase comprised of a cell adhesion like extracellular region and two 

25 intracellular PTPase domains, and is proposed to regulate cell-matrix interactions (Mourey and Dixon, Curr. Qpin. Gen. 
Dev. 4: 31-39 [1994]). Trio represents a unique member of the Rho-GEFs family possessing two functional GEF 
domains of distinct specificities. For example, GEF1 is specifically active on Rad GTPase, while GEF2 targets RhoA 
GTPase (Debant et al, supra). This unique feature allows Trio to link Rho and Rac specific signalling pathways in vivo. 
The Rho family of Ras-Jike GTPases includes Rac (1, 2 and 3), RhoG, Cdc42Hs, TC10, TTF/RhoH, Rho (A, B 

30 and C), RhoD, RhoE, and RhoL These proteins and other Ras-like proteins constitute Ras superfamily of structurally 
and functionally related GTPase proteins. These proteins are involved in diverse physiological functions such as control 
of cell shape (reviewed in Tapon and Hall, Curr. Qpin. Cell Biol. 9: 86-92 [1997]), cell motility (Aepfelbacher et al, 
Proc. Natl. Acad. Sci. USA 91: 4263-4267 [1994]; and Curr. Biol. 6: 70-75 [1996]), cell polarity (Adams et al, J. Cell 
Biol. 111: 131-142 (1990]), smooth muscle contraction (Hirata et al, J. Biol. Chem. 267: 8719-8722 119921), cell 
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adhesion (Nobes and Hall, Cell 81: 53 62 [1995]; Braga et al., J. Cell Biol. 137: 1421-1431 [1997]), cell division 
(Dutartre et al., J. Cell Set. 109:367-377 [1996]), vesicular transport between organelles such as receptor-mediated 
endocytosis (Lamaze etal., Nature 382: 177-179 [1996]), apoptosis (Esteve et al.,Oncogene 11: 2657-2665 [1995]; 
Jimenez et al.. Oncogene 10: 811-816 [1995]; Gulbins etal, J. Biol. Chem. 271: 26389-26394 [1996]; Moorman et 
5 al., J. Immunol. 156: 4146-4153 [1996]; Brenner et al., J. Biol. Chem. 272: 22173-22181 [1997]) and normal and 
pathological cell proliferation (Olson et al., Science 269: 1270-1272 [1995); Hirai et al., J. Biol. Chem. 272: 13-16 
[1997]; Khosravi-Far et al., Mol.Cell. Biol. 16: 3923-3933 [1996]; Qiu et al., Mot. Cell. Biol. 17: 3449-3458 [1997]; 
Roux etal, Curr. Biol. 7: 629-637 [1997]). 

The presence of a GEF domain and protein kinase domains along with its homology to Trio suggests that 
10 H1 9G5 may possess guanine nucleotide exchange factor activity and protein kinase activity, both of which are shared 
by a number of proteins involved in signal transduction. 

Expression pattern of 19G5 in rat and human tissues 

Northern blot analysis revealed that R19G5 gene is highly expressed in heart as a 2 kb and a 4.4 kb 

1 5 transcripts (Figure 8). Hybridization was performed using multiple tissue Northern blot (Clontech, Palo Alto, CA) and 
ExpressHyb solution following the manufacturer's protocol. The R19G5 probe also hybridized to mRNAs from skeletal 
muscle and detected multiple weak bands. High background observed in a lane corresponding to skeletal muscles is 
not due to RNA degradation as probing of the same blot with (3-actin probe detected the right sized transcripts with a 
clean background (Figure 8). The significance of the high background in skeletal muscle is not clear. Expression of 

20 R1 9G5 was not detected in brain, kidney, spleen, lung, liver, and testis. 

Hybridization of human multiple tissue Northern blot (Clontech, Palo Alto, CA) with H19G5 probe detected a 
strong transcript of about 5 kb and three minor transcripts of about 3 kb, 2.4 kb, and 1.8 kb in heart tissue (Figure 9). 
There were multiple transcripts and high background in skeletal muscle as seen with rat tissue. Only the 3 kb 
transcript was detected in human fetal heart. There is a low level expression of the 2.4 kb transcript in brain. H19G5 

25 expression was not detected in spleen, lung, liver, kidney, pancreas, thymus, prostates, testis, ovary, small intestine, 
colon, peripheral blood leukocyte, stomach, thyroid, spinal cord, lymph node, trachea, adrenal gland, bone marrow, 
uterus muscle, or bladder muscle by Northern blot. 

In order to determine whether the multiple transcripts of 19G5 in human heart are derived from alternative 
splicing or multiple gene copies, Southern blot analysis of genomic DNA was performed with H19G5 probe (Figure 10). 

30 The results suggest that H1 9G5 is a single copy gene. This is consistent with the lack of detection of polymorphism in 
the restriction analysis of the genomic clones of HI 9G5. 

Functional characterization of H19G5 protein 
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A 20 amino acid peptide from the C-terminus of H19G5 protein (SEQ ID NO: 1) was used to raise polyclonal 
antibodies in rabbits. Affinity purified rabbit anti-H19G5 antiserum has been obtained. Two different GST-H19G5 
fusion proteins were used as antigens to generate mouse monoclonal anti-H19G5 antibodies. One of the antigens used 
was a GST-H19G5 fusion protein containing amino acid residues 610 to 811 of SEQ ID NO: 1. Many clones of ami- 

5 19G5 monoclonal antibodies were obtained. Three of them were shown to recognize recombinant H19G5 proteins 
expressed in mammalian cells by Western analysis and immunocytochemistry. The antibodies may be used to determine 
the size and localization of 19G5 protein by Western blot and immunohistochemistry. 

Subcellular localization of H19G5 protein was determined using confocal microscopy on cells transfected 
with vectors expressing 19G5 proteins fused to Green Fluorescent Protein (GFP). Three 19G5-GFP fusion protein 

10 expression constructs were made using three different 19G5 cDNA clones, the longest human 19G5 clone C11 
[H19G5(C11)GFP], a 2.7 kb clone of human 19G5 containing the C-terminal kinase domain [H19G59FU-GFP], and the 
rat 19G5 small transcript [R1965(S)-GFP]. The control GFP vector and the 19G5-GFP fusion expression constructs 
were transfected into C2C12 cells (ATCC Catalog No. CRM 772). Cells were grown on chamber slides and transfected 
using SuperFectamine reagent from Quiagen for 3 hrs. Cells were fixed 24 hrs post-transfection with 4% 

15 paraformaldehyde for 15 min at room temperature and examined using a confocal microscope. The two longer forms 
of human 19G5-GFP proteins were detected in the nuclei whereas the short rat 19G5-GFP fusion protein was detected 
in the cytoplasm (Figure 11). This suggests that different forms of 19G5 proteins are localized in different regions in 
the cells and may have different functions. Since the gene is normally expressed in heart, the localization of 19G5 
protein in cardiac myocytes may be examined. Antisense constructs to study the effects of inhibiting the activity of 

20 1 9G5 protein on cardiac myocytes may also be made and used. 

A possibility that H19G5 might interact with various G proteins, as suggested by the presence of GEF 
domain and homology with Trio, was examined. HEK 293 cells constitutively expressing Epstein-Barr Virus Nuclear 
Antigen (EBNA) (Invitrogen, San Diego, CA) were transfected with H19G5-C11 expression construct. The eukaryotic 
vector used for 19G5 expression, pEAK8 (EdgeBiosystems), contains Epstein-Barr virus (EBV) origin of DNA 

25 replication, which allows replication of the expression construct in transfected cells thereby amplifying the level of 
protein expression. After 24 hrs, one 10 cm dish of transfected 293 EBNA cells were lysed with 1 ml of lysis buffer 
(IX PBS, 0.1% Triton, and proteinase inhibitors (0.2 mM AEBSF, 0.16 Aprotinin, 0.01 mM Bestatin, 3 E-64, 
4 |iM Leupeptin, and 2 jiM Pepstatin) on ice for 30'. Cells were then homogenized with a dounce homogenizer on ice. 
The lysates were cleared by centrifugation. One ml of the cleared lysate was incubated with 5 ng of various small G 

30 proteins, expressed as GST fusion proteins and bound to glutathione-agarose beads, at 4° C for 2 hrs. The protein- 
agarose complex was pelleted by brief centrifugation and washed for 4 times 5' each with the lysis buffer at room 
temperature. The complex was then resuspended in SDS sample buffer, boiled and run on a SOS-PAGE and Western 
blotted with an anti-H19G5 monoclonal antibody. As shown in Figure 12, H19G5 was found to bind to Cdc42. No 
binding was detected with Rad or RhoA. Lane 8 is His-tagged Cdc42 and in lane 9 His-tagged Cdc42-agarose 
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complex was boiled for 5' before adding to the H19G5 cell lysate. This result showed that H19G5 specifically 
interacts with Cdc42 protein and there is no non-specific interaction with agarose beads. The results presented herein 
suggest that 1 9G5 protein may play an important role, by virtue of regulating a small GTPase such as Cdc42Hs, in a 
variety of cellular activities. For example, Cdc42 has been shown to regulate actin polymerization and focal adhesion 
5 complex formation which in turn is necessary for filopedia formation (Nobes and Hail, ft// 81: 53-62 [1995]). Cdc42 
and rac have also been shown to regulate Jun N-terminal kinase (JNK) activity via the MAP kinase pathway (Coso et 
a!., &?//81: 1137 1146 (1995); Minden et al., ft?// 81: 1147 1157 [1995]; Olson et al., Science 269: 1270-1272 
[1995]), an evolutionarily conserved and ubiquitous signal transduction pathway that impacts upon a number of 
important cellular functions. 

10 C2C12 myoblasts cells (ATCC Catalog No. CRM 772) can be induced to differentiate into myotubes when 

placed in a medium containing 2% horse serum (Lechner et al, Proc. Natl Acad. ScL USA 93: 4355-4359 [1996]). 
Figure 13 shows phase contrast micrographs of C2C12 myoblasts undergoing differentiation into myotubes. At Day 
0, undifferentiated C2C12 cells with typical myoblast morphology can be seen. Once induced to differentiate, as 
shown here at Day 3 and 5 after induction, an increasingly larger number of cells with typical morphology of 

1 5 differentiated myotubes, i.e. large, elongated, multinucleated syncytial cells, could be seen. The induction of Myogenin 
expression was monitored during differentiation. Myogenin is not expressed in myoblasts, however, its expression is 
strongly induced when myoblasts undergo differentiation into myotubes. Thus, it acts as a biochemical marker of 
myotubes. Figure 14 shows induction of myogenin protein expression when C2C12 myoblast cells are induced to 
differentiate into myotubes. C2C12 cells were cultured and induced to differentiate by placing in a medium containing 

20 2% horse serum. Cells were lysed in radioimmunoprecipitation (RIPA) buffer (1X PBS containing 1% Igepal CA-630, 
0.5% sodium deoxycholate and 0.1% SDS) on ice for 30'. Total lysates were cleared by centrifugation at 10,000 rpm 
for 10'. Protein concentration of each lysate was measure using the BCA method. SDS sample buffer was added to 
the total lysate and boiled for 3'. Equal amount of total protein of each sample was run on SDS-PAGE and blotted 
using an anti-myogenin monoclonal antibody. The result demonstrated that Myogenin expression was induced 

25 concomitantly with differentiation of C2C12 myoblasts into myotubes under the conditions used for induction. 

The expression of 19G5 RNA was also monitored at various stages during differentiation of myoblasts into 
myotubes. Undifferentiated C2C12 cells were plated on 6 cm dishes at 2.4 x 10 4 cells/cm 2 and cultured in growth 
medium with 10% fetal bovine serum for 24 hrs to about confluence. Cells were washed with PBS and induced to 
differentiate into myotubes in differentiation medium with 2% horse serum. Total RNA was isolated from cells at 1, 

30 2, or 4 days post-induction using Qiagen's Rneasy kit according to the manufacture's instruction. The expression of 
19G5 transcript was analyzed using Taqman assay. One microgram of total RNA, isolated at various time points post- 
induction, was reverse transcribed into cDNA using PE Biosystems Reagents and Multiscribe enzyme according to 
manufacture's instruction. Ten ng of cDNA was added to 1X master mix, and the primers and probe for the gene of 
interest were added according to manufacture's instructions. The reaction were carried out in the ABI Prism 7700 
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Detection System. The quantity of 19G5 and the quantity for 18S were determined for each sample, and the ratio of 
19G5/ 18S was used to evaluate differences in the level of 19G5 expression in various samples. Comparison of the 
values thus obtained with pre-induction values allowed to determine fold induction of 19G5 expression during 
differentiation. As shown in Figure 15, expression of 19G5 transcript was significantly increased during differentiation 

5 of C2C12 myoblasts into myotubes. The level of induction reached to about 10-fold at 4 days post-induction, 
increased expression of 19G5 likely reflects a specific function in myotubes. 

TGF-P is known to inhibit differentiation of C2C12 myoblast into myotubes (Katagiri etal., J. Cell Biol 127: 
1755-1766 [1994]; Namiki et aL, J. Biol. Chem. 272: 22046-22052 [1997]). Therefore, the effect of TGF-P on the 
induction of expression of 19G5 during differentiation of C2C12 cells was examined. C2C12 cells were plated in 6 cm 

10 dishes at 2.4 x 10 4 cells/cm 2 and cultured in the growth medium for 24 hours. Cells were then rinsed with PBS and 
induced to differentiate for 4 days in the medium containing 5% fetal bovine serum either in the absence or in the 
presence of 10 ng/ml of TGF-p. Total RNA was isolated from cells using Qiagen's Rneasy kit. Induction of 19G5 
expression in TGF-p treated or untreated cells over undifferentiated C2C12 cells was measured by Taqman assay. As 
shown in Figure 16, the induction of 19G5 expression during C2C12 differentiation is inhibited by TFG-p. TGF-P is a 

15 powerful regulator of cell growth and differentiation and regulation of expression of 19G5 by TGF-P likely represents 
an important physiological event with significant relevance to normal and abnormal changes in cardiac cells. 
Furthermore, the results suggest a possible inolvement of 19G5 in myogenesis. 

Example 2 (Polypeptides which can mediate signal transduction! 

20 The polypeptides of the present invention, such as the specific embodiment shown in SEQ ID NOs: 1, 4, 6, 7, 

8 or 9 may be prepared by any known techniques. Conveniently, the polypeptides may be prepared using the solid- 
phase synthetic technique initially described by Merrifield in J. Am. Chem. Soc. 15:2149-2154 (1963). Other peptide 
synthesis techniques may be found, for example, in M. Bodanszky et aL, PEPTIDE Synthesis, John Wiley & Sons, 2d 
Ed. (1976) as well as in other reference works known to those skilled in the art. A summary of peptide synthesis 

25 techniques may be found in J. Stuart and J.D. Young, Solid Phase Peptide Synthelia, Pierce Chemical Co., Rockford, 
IL (1984). The synthesis of peptides by solution methods may also be used, as described in The Proteins, Vol-ll, 3d 
Ed., Neurath, H. etal.. Eds., p.105-237. Academic Press, New York, NY (1976). Appropriate protective groups for use 
in such syntheses will be found in the above texts as well as in J.F.W. McOmie, Protective Groups in Organic 
Chemistry, Plenum Press, New York, NY (1973). In general, these synthetic methods involve the sequential addition 

30 of one or more amino acid residues or suitably protected amino acid residues to a growing peptide chain. Normally, 
either the amino or carboxyl group of the first amino acid residue is protected by a suitable, selectively-removable 
protecting group. A different, selectively-removable protecting group is utilized for amino acids containing a reactive 
side group, such as lysine. 
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Using a solid phase synthesis as an example, the protected or derivatized amino acid is attached to an inert 
solid support through its unprotected carboxyl or amino group. The protecting group of the amino or carboxyl group is 
then selectively removed and the next amino acid in the sequence having the complementary (amino or carboxyl) group 
suitably protected is admixed and reacted under conditions suitable for forming the amide linkage with the residue 
5 already attached to the solid support. The protecting group of the amino or carboxyl group is then removed from this 
newly added amino acid residue, and the next amino acid (suitably protected) is then added, and so forth. After all the 
desired amino acids have been linked in the proper sequence, any remaining terminal and side group protecting groups 
(and solid support) are removed sequentially or concurrently, to provide the final peptide. The polypeptides of the 
invention preferably are devoid of benzylated or methylbenzylated amino acids. Such protecting group moieties may 
10 be used in the course of synthesis, but they are removed before the polypeptides are used. Additional reactions may 
be necessary, as described elsewhere to form intramolecular linkages to restrain conformation, if desired. The 
polypeptides of the present invention may also be linked to an additional sequence of amino acids either or both at the 
Nterminus and at the C-terminus. Such additional amino acid sequences, or linker sequences, can be conveniently 
affixed to a detectable label, solid matrix, or carrier. Typical amino acid residues used for linking are tyrosine, 
1 5 cysteine, lysine, glutamic acid and aspartic acid, or the like. 

Of course, the present polypeptides may also be prepared by recombinant DNA techniques as described, for 
example, in Sambrook, etaL, Molecular Cloning: A Laboratory Manual, Chapters 1-18, Second Edition (Cold Spring 
Harbor NY 1989), and as detailed in Examples 34 infra. The present invention also relates to vectors comprising DNA 
molecules of the present invention, host cells which are genetically engineered with vectors of the invention and the 
20 production of polypeptides of the invention by recombinant techniques. Host cells may be genetically engineered 
(transduced or transformed or transfected) with the vectors of this invention which may be, for example, a cloning 
vector or an expression vector. The vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc. 
The engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying genes. The culture conditions, such as temperature, pH and the like, 
25 are preferably those previously used with the host cell selected for expression, and will be apparent to the skilled 
artisan. 

Example 3 (Hosts, Vectors and DNA encoding polypeptides which can mediate signal transduction) 
The DNA molecules of the present invention may be employed for producing the polypeptides of the present 
30 invention by recombinant techniques. More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences broadly described above. Thus, for example, the DNA molecule sequence 
may be included in any one of a variety of expression vehicles, in particular vectors or plasmids for expressing such a 
polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of 
SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors derived from combinations of plasmids and phage DNA; 
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viral DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. The following vectors are provided by way 
of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBS, phagescript, psiX174, pBluescript SK, pBsKS, pNH8a, 
pNH16a, pNH18a, pNH46a (Stratagene); pTRC99A, pKK223-3, pKK233-3, pDR540, PRIT5 (Pharmacia). Eukaryotic: 
pWLneo, pSV2cat, pOG44, pXT1, pSG (Stratagene); pSVK3, pBPV, pMSG, PSVL (Pharmacia). However, any other 
5 vector or plasmid may be used as long as they are replicabie and viable in the host. The vector containing the 
appropriate DNA sequence, as well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the polypeptides of the present invention. Representative examples of 
appropriate hosts include: bacterial cells, such as £ coli, Salmonella typhimuritim, Streptomyces\ fungal cells, such as 
yeast; insect cells, such as Drosophila S2 and Spodoptera Sf 9; animal ceils, such as CHO, COS or Bowes melanoma; 
1 0 adenoviruses; plant cells, etc. The selection of an appropriate host is deemed to be within the scope of those skilled in 
the art. 

The appropriate DNA sequence may be inserted into the vector by a variety of procedures. In general, the 
DNA sequence is inserted into an appropriate restriction endonuclease site by procedures known in the art. Such 
procedures and others are deemed to be within the scope of those skilled in the art. 

15 The DNA sequence in the expression vector may be operatively linked to an appropriate expression control 

sequence(s) (promoter) to direct mRNA synthesis. Suitable promoter regions can be selected from any desired gene 
using CAT (chloramphenicol transferase) vectors or other vectors with selectable markers. Two appropriate vectors 
are pKK232-8 and pCM7. Particular named bacterial promoters include laci, lacZ, T3, T7, gpt, lambda P R , P L and trp. 
Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, 

20 and mouse metallothionein l. Selection of the appropriate vector and promoter is well within the level of skill in the 
art. The expression vector may also contain a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably may contain a gene to provide a phenotypic trait for selection 
of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such 

25 as tetracycline or ampiciilin resistance in £ coll. 

An embodiment of the invention is an isolated DNA molecule comprising the nucleotide sequence of SEQ ID 
NOs: 2, 3 or 5. This nucleotide sequence, or fragments or functional equivalents thereof, may be used to generate 
recombinant DNA molecules that direct the expression of the polypeptides of the present invention, or functionally 
active peptides or functional equivalents thereof, in appropriate host cells. Due to the degeneracy of the nucleotide 

30 coding sequence, other DNA sequences which encode substantially the same amino acid sequences as depicted in SEQ 
ID NOS: 1, 4, 6, 7, 8 or 9, or analogs or fragments thereof, may be used in the practice of the invention for the cloning 
and expression of a mediator of signal transduction. Such alterations include deletions, additions or substitutions of 
different nucleotide residues resulting in a sequence that encodes the same or a functionally equivalent gene product. 
The gene product may contain deletions, additions or substitutions of amino acid residues within the sequence, which 
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result in a silent change thus producing a bioactive product. Such amino acid substitutions may be made on the basis 
of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, the amphipathic nature of the residues 
involved and/or on the basis of crystallographic data. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; amino acids with uncharged polar 
5 head groups having similar hydrophilicity values include the following: leucine, isoleucine, valine; glycine, alanine; 
asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. 

Techniques well known to those skilled in the art for the isolation of DNA, generation of appropriate 
restriction fragments, construction of clones and libraries, and screening recombinants may be used. For a review of 
such techniques, see, for example, Sambrook, et al. t supra, the disclosure of which is hereby incorporated by 

10 reference. Also, the 5' untranslated and coding regions of the nucleotide sequence could be altered to improve the 
translationai efficiency of the mRNA. In addition, based on X-ray crystallographic data, sequence alterations could be 
undertaken to improve protein stability, e.g., introducing disulfide bridges at the appropriate positions, and/or deleting 
or replacing amino acids that are predicted to cause protein instability. These are only examples of modifications that 
can be engineered to produce a more active or stable protein, more protein, or even change the substrate specificity of 

1 5 the protein. 

Example 4 (Cells transformed with recombinant vectors containing DNA encoding polypeptides which can 
mediate signal transduction) 

In a further embodiment, the present invention relates to host cells containing the above-described construct. 
20 The host cell can be a higher eukaryotic cell, such as a mammalian cell, or a lower eukaryotic ceil, such as a yeast cell, 
or the host cell can be a prokaryotic cell, such as a bacterial cell. The host cell preferably may secrete the 
recombinant protein. Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAEDextran mediated transfection, or electroporation |L. Davis et aL, Basic Methods in Molecular 
Biology, 1986)). 

25 The constructs in host cells can be used in a conventional manner to produce the gene product encoded by 

the recombinant sequence. Alternatively, the polypeptides of the invention can be synthetically produced by 
conventional peptide synthesizers. Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 
cells under the control of appropriate promoters. Cell-free translation systems can also be employed to produce such 
proteins using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expression 

30 vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et aL, supra. 

Transcription of a DNA encoding the polypeptides of the present invention by higher eukaryotes may be 
increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually from 
about 10 to 300 bp, that act on a promoter to increase its transcription. Examples include the SV40 enhancer on the 
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late side of the replication origin (base pair 100 to 270), a cytomegalovirus early promoter enhancer, a polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication and selectable markers 
permitting transformation of the host cell, e.g., the ampicillin resistance gene of £ coli and S. cerevisiae TRP1 gene, 
5 and a promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence. 
Such promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (P6K), 
alpha factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
preferably assembled in appropriate phase with translation, initiation and termination sequences, and preferably, a 
leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium. 

10 Optionally, the heterologous sequence can encode a fusion protein including an N-terminal identification peptide 
imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use may be constructed by inserting a structural DNA sequence 
encoding a desired protein together with suitable translation, initiation and termination signals in operable reading 
phase with a functional promoter. The vector may comprise one or more phenotypic selectable markers and an origin 

1 5 of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include £ coli, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a 
matter of choice. 

As a representative but nonlimiting example, useful expression vectors for bacterial use can comprise a 
20 selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl). These pBR322 
backbone sections are combined with an appropriate promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell 
25 density, the selected promoter may be de-repressed by appropriate means (e.g., temperature shift or chemical 
induction) and cells may be cultured for an additional period. Cells are typically harvested by centrifugation, disrupted 
by physical or chemical means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any convenient method, including 
freezethaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Various mammalian cell culture 
30 systems can also be employed to express recombinant polypeptides. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 (1981), and other cell lines 
capable of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian 
expression vectors may comprise an origin of replication, a suitable promoter and enhancer, and also any necessary 
ribosome binding sites, polyadenylation sites, splice donor and acceptor sites, transcriptional termination sequences, 
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and 5' flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 
origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed 
genetic elements. 

The polypeptides of the present invention may be recovered and purified from recombinant cell cultures by 
5 methods used heretofore, including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxyapatite chromatography and lectin chromatography. Protein refolding steps can be used, as 
necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) 
can be employed for final purification steps. 

1 0 The polypeptides of the present invention may be a naturally purified product, or a product of chemical 

synthetic-procedures, or produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, by 
bacterial, yeast, higher plant, insect and mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention may be glycosylated with mammalian or 
other eukaryotic carbohydrates or may be non-glycosylated. Polypeptides of the invention may also include an initial 

1 5 methionine amino acid residue. 

In particular, two baculovirus expression constructs of a wild-type and a mutant H19G5 C-terminal kinase 
domain have been constructed. The wild-type kinase domain construct was made by cloning a cDNA fragment which 
encodes the amino acid residues 1002 to 1314 of the H19G5 contig protein sequence into the pFastBac HTc vector. 
The mutant kinase domain construct contains the same amino acid sequence as the wild-type kinase domain except 

20 that Tyr residue at position 1213 was changed to a Glu in an attempt to create a constitutively active kinase. Both 
recombinant proteins contain a his-tag at the N-terminus. 

Example 5 (Pharmaceuticalfy acceptable salts of polypeptides which can mediate signal transduction) 

Any peptide of the present invention may be used in the form of a pharmaceutical^ acceptable salt. 

25 Suitable acids which are capable of forming salts with the peptides of the present invention include inorganic acids 
such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, phosphoric acid 
and the like; and organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, 
oxalic acid, malonic acid, succinic acid, maleic acid, fumaric acid, anthranilic acid, cinnamic acid, naphthalene sulfonic 
acid, sulfanilic acid, and the like. 

30 Suitable bases capable of forming salts with the peptides of the present invention include inorganic bases 

such as sodium hydroxide, ammonium hydroxide, potassium hydroxide and the like; and organic bases such as mono-, 
di- and tri-alkyl and aryl amines (e.g., triethylamine, diisopropyl amine, methyl amine, dimethyl amine and the like) and 
optionally substituted ethanolamines (e.g., ethanolamine, diethanolamine and the like). 
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Example 6 (Pharmaceutical compositions containing polypeptides which can mediate signal transduction) 
For use in a method of identification, prevention or treatment such as the identification, prevention or 
treatment of infection of a mammalian host by a microorganism, the polypeptides of the present invention may be 
present in a pharmaceutical composition in admixture with a pharmaceutical^ acceptable sterile vehicle. The 
5 pharmaceutical composition may be compounded according to conventional pharmaceutical formulation techniques. 

The vehicle may take a wide variety of forms depending on the form of preparation desired for 
administration, e.g., sublingual, rectal, nasal, oral or parenteral. Compositions for oral dosage form may include any of 
the usual pharmaceutical media, such as, for example, water, oils, alcohols, flavoring agents, preservatives, coloring 
agents and the like in the case of oral liquid preparations [e.g., suspensions, elixirs and solutions) or carriers such as 
10 starches, sugars, diluents, granulating agents, lubricants, binders, disintegrating agents and the like in the case of oral 
solid preparations [e.g., powders, capsules and tablets). Controlled release forms may also be used. Because of their 
ease in administration, tablets and capsules represent an advantageous oral dosage unit form, in which case solid 
pharmaceutical carriers may be employed. If desired, tablets may be sugar coated or enteric coated by standard 
techniques. 

1 5 For compositions to be administered parenteral^, the carrier will usually comprise sterile water, although 

other ingredients to aid solubility or for preservation purposes may be included. Injectable suspensions may also be 
prepared, in which case appropriate liquid carriers, suspending agents and the like may be employed. The parenteral 
routes of administration may be intravenous injection, intramuscular injection or subcutaneous injection. 

For intravenous administration, the polypeptides may be dissolved in an appropriate intravenous delivery 

20 vehicle containing physiologically compatible substances such as sodium chloride, glycine and the like, having a 
buffered pH compatible with physiologic conditions. Such intravenous delivery vehicles are known to those skilled in 
the art. 

The polypeptides of the invention may be administered to subjects where mediation of signal transduction is 
desired. The peptides may be administered by any convenient means that will result in the delivery to the subject of 
25 an effective amount to mediate signal transduction. Oral administration is presently contemplated as a preferred 
administration route. The amount administered will depend on the activity of the particular compound administered, 
which may readily be determined by those of ordinary skill in the art. 

Example 7 (Monoclonal antibodies against polypeptides which can mediate signal transduction) 
30 Another embodiment of the present invention relates to a monoclonal antibody to the polypeptides of the 

present invention (or an antigenic portion thereof), which may be produced by methods recognized in the art, including 
the formation of monoclonal antibody-producing hybridomas (Kohler, G., and C. Milstein, Nature 256:495497 (1975); 
Eur. J. Immunol. 6:511-519 (1976)). By fusing antibody-forming cells (spleen lymphocytes) with myeloma cells 
(malignant cells of bone marrow primary tumors), a hybrid cell line is created from a single fused cell hybrid (called a 
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hybridoma or clone) having certain inherited characteristics of both the lymphocytes and myeloma cell lines. Like the 
lymphocytes (taken from animals primed with sheep red blood cells as antigen), the hybridomas secreted a single type 
of immunoglobulin specific to the antigen; moreover, like the myeloma cells, the hybrid cells had the potential for 
indefinite cell division. The combination of these two features offered distinct advantages over conventional antisera. 
5 Whereas antisera derived from vaccinated animals are variable mixtures of polyclonal antibodies which never can be 
reproduced identically, monoclonal antibodies are highly specific immunoglobulins of a single type. The single type of 
immunoglobulin secreted by a hybridoma is specific to one and only one antigenic determinant, or epitope, on the 
antigen, a complex molecule having a multiplicity of antigenic determinants. For instance, if the antigen is a protein, 
an antigenic determinant may be one of the many peptide sequences (generally 6-7 amino acids in length (Atassi, M.Z., 
10 Molec. Cell. Biochem. 32:2143 (1980)) within the entire protein molecule. Hence, monoclonal antibodies raised 
against a single antigen may be distinct from each other depending on the determinant that induced their formation; 
but for any given clone, all of the antibodies it produces are identical. Furthermore, the hybridoma cell line can be 
reproduced indefinitely, is easily propagated in vitro or in vivo, and yields monoclonal antibodies in extremely high 
concentration. 

15 

Example 8 (Therapeutic monoclonal antibodies against polypeptides which can mediate signal transduction) 
The monoclonal antibodies of the present invention can have potential immunotherapeutic value (Oldham, 
R.K., 1 Clin. Oncol., 1:582-590 (1983); Miller, R.A. et aL, Blood, 62:988-995 (1983); Miller R.A. et aL, New Engl. J. 
Med. 306:517-522 (1982); Ritz, J. and Schlossman, S., Blood, 59:1-11 (1982); and Kirch, M.E. and Ulrich, H., J. 

20 Immunol. 127:805-810 (1981) (investigating the therapeutic efficacy in both animal and human subjects)). In 
addition, the monoclonal antibodies can be used in cytotoxic drug-antibody conjugates similar to those described in 
Beverly, P.C.L., Nature, 297:358-9 (1982); Krolick, K.A. et aL, Nature, 295:604-5 (1982); Krolick, K.A. et aL, Proc. 
Natl. Acad. ScL U.S.A., 77:5419-23 (1980); Arnon, R. and Sela, M., Immunol. Rev., 62:5-27 (1982); Raso, V. et aL, 
Cancer Res., 42:457-64 (1982); and DeWeger, R.A. and Dullens, H.F.J., Immunol. Rev. 62:29-45 (1982). 

25 In an embodiment of the invention, purified polypeptides of the present invention (or an antigenic portion 

thereof) can be used as an antigen or immunogen. In addition, microorganisms expressing H19G5 protein or 
polypeptide fragments thereof also represent potential antigens or sources of antigen with which to immunize animals 
to obtain somatic cells for fusion. Somatic cells with the potential for producing antibody and, in particular, B 
lymphocytes, are suitable for fusion with a B-cell myeloma line. Those antibody-producing cells that are in the dividing 

30 plasmablast stage fuse preferentially. Somatic cells may be derived from the lymph nodes, spleens and peripheral 
blood of primed animals and the lymphatic cells of choice depending to a large extent on their empirical usefulness in 
the particular fusion system. Once-primed or hyperimmunized animals can be used as a source of antibody-producing 
lymphocytes. Mouse lymphocytes give a higher percentage of stable fusions with mouse myeloma lines. However, 
the use of rat, rabbit, and frog cells is also possible. Alternatively, human somatic cells capable of producing antibody, 
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specif ically B lymphocytes, are suitable for fusion with myeloma cell lines. While B lymphocytes from biopsied spleens 
or lymph nodes of individual may be used, the more easily accessible peripheral blood B lymphocytes are preferred. 
The lymphocytes may be derived from patients with diagnosed carcinomas. 

Specialized myeloma cell lines have been developed from lymphocyte tumors for use in hybridomaproducing 
5 fusion procedures (Kohler, G., and C. Milstein, Eur. J. Immunol 6:511-519 (1976); M. Schulman et ai, Nature 276: 
269-270 (1978)). Examples of myeloma cell lines that may be used for the production of fused cell hybrids include 
X63-Ag8, NSI-Ag4/1, MPCII45.6TGI.7, C63-Ag8.653, Sp2/0Ag14, FO, and S194/5XX0.BU.1 f all derived from mice; 
210.RCY3.Agl.2.3 f U-226AR, and GM1500GTGAL2, all derived from rats; and U-226AR and GM150QGTGAL2, 
derived from humans, (G.J. Hammerling, U. Hammerling, and J.F. Kearney (editors), Monoclonal Antibodies and Tce/f 
10 Hybridomas in: J.L Turk (editor) Research Monographs in Immunology, Vol. 3, Elsevier/North Holland Biomedical 
Press, NY (1981)). 

Methods for generating hybrids of antibody-producing spleen or lymph node ceils and myeloma cells usually 
comprise mixing somatic cells with myeloma cells in a 2:1 proportion (though the proportion may vary from about 20:1 
to about 1:1), respectively, in the presence of an agent or agents (chemical or electrical) that promote the fusion of 

1 5 cell membranes. It is often preferred that the same species of animal serve as the source of the somatic and myeloma 
cells used in the fusion procedure. Fusion methods have been described by Kohler and Milstein [Nature 256:495497 
(1975) and' At. J. Immunol. 6:511-519 (1976), and by Gefter et ai [Somatic Cell Genet. 3:231-236 (1977)). The 
fusion-promotion agents used by those investigators were Sendai virus and polyethylene glycol (PEG), respectively. 

Once trie desired fused cell hybrids have been selected and cloned into individual antibody-producing cell 

20 lines, each cell line may be propagated in either of two standard ways. A sample of the hybridoma can be injected 
into a histocompatible animal of the type that was used to provide the somatic and myeloma cells for the original 
fusion. The injected animal develops tumors secreting the specific monoclonal antibody produced by the fused cell 
hybrid. The body fluids of the animal, such as serum or ascites fluid, can be tapped to provide monoclonal antibodies 
in high concentration. Alternatively, the individual cell lines may be propagated in vitro in laboratory culture vessels; 

25 the culture medium, also containing high concentrations of a single specific monoclonal antibody, can be harvested by 
decantation, filtration or centrifugation. 

Example 9 (Diagnostic monoclonal antibodies against polypeptides which can mediate signal transduction) 
The monoclonal antibodies of this invention can be used as probes in detecting discrete antigens expressed 
30 by tissue or cell samples. The expression of lack of expression of these antigens can provide clinically exploitable 
information that is not apparent after standard histopathological evaluations. It may thus be possible to correlate the 
immuno-phenotypes of individual tissue or cell samples with various aspects of signal transduction and responsiveness 
to certain types of therapies, thereby establishing important classifications of prognosis. 
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The use of the monoclonal antibodies described herein can be extended to the screening of human biological 
fluids for the presence of the specific antigenic determinant recognized. In vitro immunoserological evaluation of sera 
withdrawn from patients thereby permits non-invasive diagnosis of disease. By way of illustration, human fluids, such 
as pleural fluids or lymph, can be taken from a patient and assayed for the specific epitope, either as released antigen 

5 or membrane-bound on cells in the sample fluid, using monoclonal antibodies against the polypeptides of the present 
invention in standard radioimmunoassays or enzyme-linked immunoassays known in the art or competitive binding 
enzyme-linked immunoassays. 

The monoclonal antibodies of this invention are potentially useful for targeting diseased tissue or ceils in 
vivo. They can therefore be used in humans for localization and monitoring of the microbial infection. For this 

10 application, it is preferable to use purified monoclonal antibodies. Purification of monoclonal antibodies for human 
administration by ammonium sulfate or sodium sulfate precipitation followed by dialysis against saline and filtration 
sterilization has been described by Miller et al (in: Hybridomas in Cancer Diagnosis and Therapy, (1 982), p. 1 34). 

Alternatively, immunoaffinity chromatography techniques may be used to purify the monoclonal antibodies. 
The purified monoclonal antibodies can be labeled with radioactive compounds, for instance, radioactive iodine, and 

1 5 administered to a patient intravenously. After localization of the antibodies at the infection site, they can be detected 
by emission tomographical and radionuclear scanning techniques, thereby pinpointing the location of the infection. 
Experimental radioimmunodetection with monoclonal antibodies may occur by external scintigraphy. 

Passive monoclonal serotherapy may be a potential use for the monoclonal antibodies of this invention. By 
way of illustration, purified anti-H19G5 monoclonal antibody is dissolved in an appropriate carrier, e.g., saline, with or 

20 without human albumin, at an appropriate dosage and is administered to a patient. The monoclonal antibodies are 
preferably administered intravenously, e.g., by continuous intravenous infusion over several hours, as in Miller et al, 
supra. Infusions can be administered over a period of weeks during which the anti microbial effects are monitored. 

Example 10 {AntNdiotypic antibodies to antibodies against polypeptides which can mediate signal 
25 transduction! 

In an alternate embodiment, the antibodies described herein are used to stimulate the production of 
corresponding antiidiotypic antibodies. In brief, antiidiotype antibodies, or antiidiotypes are antibodies directed 
against the antigen combining region or variable region (idiotype) of another antibody. Based on Jerne's network 
model of idiotypic relationships (Jerne, Ann. Immunol 125:373 (1974); Jerne et al, EMBO 1:234 (1982)), 
30 immunization with an antibody molecule expressing a paratope (antigen-combining site) for a given antigen should 
produce a group of anti-antibodies, some of which share with the antigen a complementary structure to the paratope. 
Immunization with a subpopulation of antiidiotypic antibodies should in turn produce a subpopulation of antiidiotypic 
antibodies which bind the initial antigen. Thus, the administration of the monoclonal antibodies of the present 
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invention may result in a modification of the host's immune response, as the consequence of the formation of ami- 
idiotypic antibodies which may develop during therapy with the monoclonals. 

Example 1 1 (Monoclonal antibody-drug conjugates) 
5 The monoclonal antibodies of this invention can be used in conjunction with a broad spectrum of 

pharmaceutical or cytotoxic agents that selectively affect diseased tissue or cells over normal tissues or cells in the 
mammalian host. The methods used for binding the cytotoxic agents to the monoclonal antibody molecule can involve 
either non-covalent or covalent linkages. Since non-covalent bonds are more likely to be broken before the antibody 
complex reaches the target site, covalent linkages are preferred. For instance, carbodiimide can be used to link 

10 carboxy groups of the pharmaceutical agent to amino groups of the antibody molecule. Bifunctionai agents such as 
dialdehydes or imidoesters can be used to link the amino group of a drug to amino groups of the antibody molecule. 
The Schiff base reaction can be used to link drugs to antibody molecules. This method involves the periodate 
oxidation of a drug or cytotoxic agent that contains a glycol or hydroxy group, thus forming an aldehyde that is then 
reacted with the antibody molecule. Attachment occurs via formation of a Schiff base with amino groups of the 

1 5 antibody molecule. Additionally, drugs with reactive sulf hydryl groups have been coupled to antibody molecules. 

Example 12 (Diagnostic kit) 

Another embodiment of the invention relates to a diagnostic kit for detecting diseased tissue or cells using 
an antibody against a polypeptide which can mediate signal transduction. The diagnostic kit may further comprise, 

20 where necessary, other components of the signal producing system, including agents for reducing background 
interference, control reagents, or an apparatus, container or other solid support for conducting the test. The binding 
of antibody to the target can be detected by well known methods, including radiation [e.g., use of a radioactive 
nucleotide), colorimetry [e.g., use of an enzyme that can cause a color change in a substrate), fluorescence [e.g., use 
of a dye such as propidium iodide, fluorescein, or rhodamine), and luminescence [e.g., use of an alkaline phosphatase 

25 substrate that releases photons upon cleavage or luciferin). Detection can be qualitative or quantitative. 

Example 13 (Gene therapy) 

Another embodiment of the present invention involves the use of the DNA of the present invention in gene 
therapy applications. Gene therapy has been broadly defined as "the correction of a disease phenotype through the 
30 introduction of new genetic information into the affected organism" (Roemer, K. and Friedmann, T., Eur. J. Biochem. 
208: 21 1-225 (1992)). Two basic approaches to gene therapy have evolved: (1) ex vivo gene therapy and (2) in vivo 
gene therapy. In ex vivo gene therapy, cells are removed from a subject and cultured in vitro. A functional 
replacement gene is introduced into the cells (transfection) in vitro, the modified cells are expanded in culture, and 
then reimplanted in the subject. These genetically modified, reimplanted ceils are reported to secrete detectable levels 
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of the transacted gene product in situ (Miller, A.D., Blood 76: 271-278 (1990)) and Selden, R.F., et aL, New Eng. J. 
Med. 317: 1067-1076 (1987)). The development of improved retroviral gene transfer methods (transduction) 
facilitates the transfer into and subsequent expression of genetic material by somatic cells (Cepko, C.L, et aL, Ceil 
37: 1053-1062 (1984)). Accordingly, retrovirus-mediated gene transfer has been used in clinical trials to mark 

5 autologous cells and as a way of treating genetic disease (Rosenberg, S.A., et at., New Eng. J. Med. 323: 570-578 
(1990); Anderson, W.F., Human Gene Therapy 2: 99-100 (1991)). Several ex vivo gene therapy studies in humans are 
reported (reviewed in Anderson, W.F., Science 256: 808-813 (1992) and Miller A.D., Nature 357: 455460 (1992)). 

In in vivo gene therapy, target cells are not removed from the subject. Rather, the transferred gene is 
introduced into cells of the recipient organism in situ, that is, within the recipient. In vivo gene therapy has been 

10 examined in several animal models (reviewed in Feigner, PI. and Rhodes, G., Nature 349: 351-352 (1991)). 
Publications have reported the feasibility of direct gene transfer in situ into organs and tissues such as muscle (Ferry, 
N., etat., Proc. Natl. Acad. Sci. 88: 8377-8781 (1991); Quantin, G., etal., Proc. Natl. Acad. Sci. USA 89: 2581-2584 
(1992)), hematopoietic stem cells (Clapp, D.W., etal., Blood 78: 1132-1139 (1991)1, the arterial wall (Nabel, E.G., et 
al. t Science 244: 1342-1344 (1989)), the nervous system (Price, J.D., et aL, Proc. Natl. Acad. Sci 84: 156-160 

15 (1987)), and lung (Rosenfeld, M.A., etal, Science 252: 431434 (1991)). Direct injection of DNA into skeletal muscle 
(Wolff, J.A., etal, Science 247: 1465-1468 (1990)), heart muscle (Kitsis, R.N., etal., Proc. Natl. Acad. Sci. USA 88: 
41384142 (1991)) and injection of DNA-lipid complexes into the vasculature (Lim, C.S., et aL, Circulation 83: 2007- 
2011 (1991); Ledere, G.D., et at., J. Clin. Invest. 90: 936-944 (1992); Chapman, G.D., et at., Clrc. Res. 71: 27-33 
(1992)) also have been reported to yield a detectable expression level of the inserted gene product(s) in vivo. 

20 Recent gene therapy efforts have been aimed at the identification of various cell types for transformation, 

including keratinocytes (Morgan, J.R., etal., Science 237: 1476-1479 (1987)), fibroblasts (Palmer, T.D., etal., Proc. 
Natl. Acad. Sci. 88: 1330-1334 (1991); Garver Jr., R.I., et aL, Science 237: 762-764 (1987); International Patent 
Application PCT/US92/01890, having publication number W0 92/15676), lymphocytes (Reimann, J.K., et aL, J. 
Immunol. Methods 89: 93*101 (1986)), myoblasts (Barr, E. and Leiden, J.M., Science 254: 1507-1509 (1991); Dai, Y. 

25 etal., PNAS 89: 10892-10895 (1992); Roman, M., etal., Somatic Cell and Molecular Genetics 18: 247-258 (1992)), 
smooth muscle cells (Lynch, CM. etal.. Proc. Natl. Acad. Sci. USA 89: 1138-1142 (1992)), and epithelial cells (Nabel, 
E.G., etal., Science 244: 1342-1344 (1989)), International Patent Application PCT/US89/05575 (having publication 
number W0 90/06997), the contents of which references and patent/patent applications are incorporated herein by 
reference. 

30 The delivery of an effective dose of a prophylactic or therapeutic agent in situ depends on the efficiency of 

transfection (or transduction) as well as the number of target cells. Epithelial cell-based gene therapy, in particular, 
involves a relatively small area available in situ for receiving genetically modified epithelial cells. The delivery of an 
effective dose of prophylactic or therapeutic agent in situ thus depends upon the total number of implanted epithelial 
cells. 
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In one embodiment of the invention, exogenous genetic materia! [e.g., a cDNA encoding a polypeptide of the 
present invention) is introduced into a syngeneic host cell ex vivo or in vivo by genetic transfer methods, such as 
transfection or transduction, to provide a genetically modified host cell. Various expression vectors (A*., vehicles for 
facilitating delivery of exogenous genetic material into a target cell) are known to one skilled in the art. 

5 Transfection refers to the insertion of nucleic acid into a mammalian host cell using physical or chemical 

methods. Several transfection techniques are known to those of ordinary skill in the art including: calcium phosphate 
DNA co-precipitation (METHODS in MOLECULAR Biology, Vol. 7, Gene Transfer and Expression Protocois, Ed. EJ. 
Murray, Humana Press (1991)); DEAE-dextran; electroporation; cationic iiposomemediated transfection; and tungsten 
particle-facilitated microparticle bombardment (Johnston, SA, Nature 346: 776-777 (1990)). Strontium phosphate 

10 DNA co-precipitation (Brash D.E. et at. Moiec. Cell. Biol. 1: 2031-2034 (1987)) is a preferred transfection method. 

In contrast, transduction refers to the process of transferring nucleic acid into a cell using a DNA or RNA 
virus. A RNA virus {i.e., a retrovirus) for transferring a nucleic acid into a cell is referred to herein as a transducing 
chimeric retrovirus. Exogenous genetic material contained within the retrovirus is incorporated into the genome of the 
transduced host cell. A host cell that has been transduced with a chimeric DNA virus [e.g., an adenovirus carrying a 

15 cDNA encoding a therapeutic agent) will not have the exogenous genetic material incorporated into its genome, but 
will be capable of expressing the exogenous genetic material that is retained extrachromosomally within the cell. 

Typically, the exogenous genetic material includes the heterologous gene (usually in the form of a cDNA 
comprising the exons coding for the therapeutic protein) together with a promoter to control transcription of the new 
gene. The promoter characteristically has a specific nucleotide sequence necessary to initiate transcription. 

20 Optionally, the exogenous genetic material further includes additional sequences [i.e., enhancers) required to obtain the 
desired gene transcription activity. For the purpose of this discussion an enhancer is simply any non-translated DNA 
sequence which works contiguous with the coding sequence (in cis) to change the basal transcription level dictated by 
the promoter. Preferably, the exogenous genetic material is introduced into the host cell genome immediately 
downstream from the promoter so that the promoter and coding sequence are operatively linked so as to permit 

25 transcription of the coding sequence. A preferred retroviral expression vector includes an exogenous promoter 
element to control transcription of the inserted exogenous gene. Such exogenous promoters include both constitutive 
and inducible promoters. 

Naturally-occurring constitutive promoters control the expression of essential cell functions. As a result, a 
gene under the control of a constitutive promoter is expressed under all conditions of cell growth. Exemplary 
30 constitutive promoters include the promoters for the following genes which encode certain constitutive or 
housekeeping functions: hypoxanthine phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR) 
(Scharfmann etaL, Proc. Natl Acad. Sci. USA 88: 4626-4630 (1991)), adenosine deaminase, phosphoglycerol kinase 
(PGK), pyruvate kinase, phosphoglycerol mutase, the beta -actin promoter (Lai et al, Proc. Natl. Acad. Sci. USA 86: 
. 10006-10010 (1989)), and other constitutive-promoters known to those of skill in the art. In addition, many viral 
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promoters function constitutively in eukaryolic cells. These include: the early and late promoters of SV40, the long 
terminal repeats (LTRs) of Moloney Leukemia Virus and other retroviruses, and the thymidine kinase promoter of 
Herpes Simplex Virus, among many others. Accordingly, any such constitutive promoters can be used to control 
transcription of a heterologous gene insert. 

5 Genes that are under the control of inducible promoters are expressed only or to a greater degree, in the 

presence of an inducing agent, {e.g., transcription under control of the metallothionein promoter is greatly increased in 
presence of certain metal ions}. Inducible promoters include responsive elements (REs) which stimulate transcription 
when their inducing factors are bound. For example, there are REs for serum factors, steroid hormones, retinoic acid 
and cyclic AMP. Promoters containing a particular RE can be chosen in order to obtain an inducible response, and in 

10 some cases, the RE itself may be attached to a different promoter, thereby conferring inducibility to the recombinant 
gene. Thus, by selecting the appropriate promoter (constitutive versus inducible; strong versus weak), it is possible to 
control both the existence and level of expression of a therapeutic agent in the genetically modified host cell. If the 
gene encoding the prophylactic or therapeutic agent is under the control of an inducible promoter, delivery of the agent 
in situ is triggered by exposing the genetically modified cell in situ to conditions for permitting transcription of the 

15 prophylactic or therapeutic agent, e.g., by intraperitoneal injection of specific inducers of the inducible promoters 
which control transcription of the agent. For example, in situ expression by genetically modified host cells of a 
therapeutic agent encoded by a gene under the control of the metallothionein promoter, is enhanced by contacting the 
genetically modified cells with a solution containing the appropriate {i.e., inducing) metal ions in situ. 

Accordingly, the amount of therapeutic agent that is delivered in situ is regulated by controlling such factors 

20 as: (1) the nature of the promoter used to direct transcription of the inserted gene \i.e., whether the promoter is 
constitutive or inducible, strong or weak); (2) the number of copies of the exogenous gene that are inserted into the 
host cell; (3) the number of transduced/transfected host cells that are administered (e.g., implanted) to the patient; (4) 
the size of the implant [e.g., graft or encapsulated expression system); (5) the number of implants; (6) the length of 
time the transduced/transfected cells or implants are left in place; and (7) the production rate of the prophylactic or 

25 therapeutic agent by the genetically modified host cell. Selection and optimization of these factors for delivery of an 
effective dose of a particular prophylactic or therapeutic agent is deemed to be within the scope of one of skill in the 
art, taking into account the above-disclosed factors and the clinical profile of the patient. 

In addition to at least one promoter and at least one heterologous nucleic acid encoding the prophylactic or 
therapeutic agent, the expression vector preferably includes a selection gene, for example, a neomycin resistance 

30 gene, for facilitating selection of host cells that have been transfected or transduced with the expression vector. 
Alternatively, the host cells are transfected with two or more expression vectors, at least one vector containing the 
gene(s) encoding the prophylactic or therapeutic agent(s), the other vector containing a selection gene. The selection 
of a suitable promoter, enhancer, selection gene and/or signal sequence is deemed to be within the scope of one skilled 
in the art. 
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The prophylactic or therapeutic agent can be targeted for delivery to an extracellular, intracellular or 
membrane location. If it is desirable for the gene product to be secreted from the host cells, the expression vector is 
designed to include an appropriate secretion signal sequence for secreting the therapeutic gene product from the cell 
to the extracellular milieu. If it is desirable for the gene product to be retained within the host cell, this secretion 
5 signal sequence is omitted. In a similar manner, the expression vector can be constructed to include retention signal 
sequences for anchoring the prophylactic or therapeutic agent within the host cell plasma membrane. For example, 
membrane proteins have hydrophobic transmembrane regions that stop translocation of the protein in the membrane 
and do not allow the protein to be secreted. The construction of an expression vector including signal sequences for 
targeting a gene product to a particular location is deemed to be within the scope of one of skill in the art. 

10 In an embodiment, vectors for mammalian host cell gene therapy are viruses, more preferably replication- 

deficient viruses (described in detail below). Exemplary viral vectors are derived from: Harvey Sarcoma Virus; Rous 
Sarcoma Virus, MPSV, Moloney murine leukemia virus and DNA viruses [e.g., adenovirus). See Temin, H., Retrovirus 
vectors for gene transfer, in Gene Transfer, Kucherlapati R, Ed., pp. 149-187 (Plenum 1986). 

Replication-deficient retroviruses are capable of directing synthesis of virion proteins, but are incapable of 

1 5 making infectious particles. Accordingly, these genetically altered retroviral expression vectors have general utility for 
high-efficiency transduction of genes in cultured cells, and specific utility for use in the method of the present 
invention. Such retroviruses further have utility for the efficient transduction of genes into host cells in vivo. 
Retroviruses have been used extensively for transferring genetic material into cells. Standard protocols for producing 
replication-deficient retroviruses (including the steps of incorporation of exogenous genetic material into a plasmid, 

20 transfection of a packaging cell line with plasmid, production of recombinant retroviruses by the packaging cell line, 
collection of viral particles from tissue culture media, and infection of the target cells with the viral particles) are 
provided in Kriegler, M. Gene Transfer and Expression, A Laboratory Manual, W.H. Freeman Co., NY (1990) and 
Murray, E.J., ed. Methods in Molecular Biology, Vol. 1, Humana Press Inc., Clifton, NJ (1991). 

The major advantage of using retroviruses for gene therapy is that the viruses insert the gene encoding the 

25 therapeutic agent into the host cell genome, thereby permitting the exogenous genetic material to be passed on to the 
progeny of the cell when it divides. In addition, gene promoter sequences in the LTR region have been reported to 
enhance expression of an inserted coding sequence in a variety of cell types [see e.g., Hilberg et a/., Proc. Natl. Acad. 
ScL USA 84: 5232 5236 (1987); Holland et at., Proc. Natl. Acad. Set. USA 84: 8662-8666 (1987); Valerio et aL, Gene 
84: 419427 (1989)). In vivo gene therapy using replication-deficient retroviral vectors to deliver a therapeutically 

30 effective amount of a therapeutic agent can be efficacious if the efficiency of transduction is high and/or the number 
of target cells available for transduction is high. 

Yet another viral candidate useful as an expression vector for transformation of mammalian host cells is the 
adenovirus, a double-stranded DNA virus. The adenovirus is frequently responsible for respiratory tract infections in 
humans and thus appears to have an avidity for the epithelium of the respiratory tract (Straus, S., The Adenovirus, 
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H.S. Ginsberg, Editor, Plenum Press, NY, p.451-496 (1984)). Moreover, the adenovirus is infective in a wide range of 
cell types, including, for example, muscle and epithelial cells (Larrick, J.W. and Burck, K.L., Gene Therapy. Application 
of Molecular Biology, Elsevier Science Publishing Co., Inc., NY, p.71-104 (1991)). The adenovirus also has been 
used as an expression vector in muscle cells in vivo (Quantin, B., et al, Proc. Natl. Acad. Sci. USA 89: 2581-2584 
5 (1992)). 

Like the retrovirus, the adenovirus genome is adaptable for use as an expression vector for gene therapy, 
i.e., by removing the genetic information that controls production of the virus itself (Rosenfeld, M.A., et al., Science 
252:431-434 (1991)). Because the adenovirus functions in an extrachromosomal fashion, the recombinant adenovirus 
does not have the theoretical problem of insertional mutagenesis. 

1 0 Thus, as will be apparent to one skilled in the art, a variety of suitable viral expression vectors are available 

for transferring exogenous genetic material into mammalian host cells. The selection of an appropriate expression 
vector to express an agent for the identification, prevention or treatment of microbial infection amenable to gene 
replacement therapy and the optimization of the conditions for insertion of the selected expression vector into the cell 
are within the scope of one of skill in the art without the need for undue experimentation. 

15 In an alternative embodiment, the expression vector is in the form of a plasmid, which is transferred into the 

target host cells by one of a variety of methods: physical [e.g., microinjection (Capecchi, M.R., Cell 22: 479-488 
(1980)), electroporation (Andreason, G.L and Evans, G.A. Biotechniques 6: 650-660 (1988)), scrape loading, 
microparticle bombardment (Johnston, S.A., Nature 346: 776-777 (1 990)) or by cellular uptake as a chemical complex 
[e.g., calcium or strontium* co-precipitation, complexation with lipid, complexation with ligand) (METHODS IN MOLECULAR 

20 Biology, Vol. 7, Gene Transfer and Expression Protocols, Ed. E. J. Murray, Humana Press (1991)). Several 
commercial products are available for cationic liposome complexation including Lipofectin (Life Technologies, Inc., 
Gaithersburg, MD) (Feigner, P.L, et al., Proc. Natl. Acad. Sci. 84: 7413-7417 (1987)) and Transfectam™ (ProMega, 
Madison, Wis.) (Behr, J.P., et al, Proc. Natl. Acad. Sci. USA 86: 6982-6986 (1989); Loeffler, J.P., et al., J. 
Neurochem. 54: 1812-1815 (1990)). However, the efficiency of transfection by these methods is highly dependent on 

25 the nature of the target cell and accordingly, the conditions for optimal transfection of nucleic acids into host cells 
using the above-mentioned procedures must be optimized. Such optimization is within the scope of one of skill in the 
art. 

In an embodiment, the preparation of genetically modified host cells contains an amount of cells sufficient to 
deliver a prophylactically or therapeutically effective dose of a signal transduction mediator of the present invention to 
30 the recipient in situ. The determination of an effective dose of the prophylactic or therapeutic agent for a known 
microbial infection is within the scope of one of skill in the art. Thus, in determining the effective dose, the skilled 
artisan would consider the condition of the patient, the severity of the condition, as well as the results of clinical 
studies of the prophylactic or therapeutic agent being administered. 
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If the genetically modified host cells are not already present in a pharmaceutical^ acceptable carrier, they 
are placed in such a carrier prior to administration to the recipient. Such pharmaceutical^ acceptable carriers include, 
for example, isotonic saline and other buffers as appropriate to the patient and therapy. The genetically modified cells 
are administered by, for example, intraperitoneal injecting or implanting the celts or a graft or capsule containing the 
5 ceils in a host cell-compatible site of the recipient. As used herein, host cell-compatible site refers to a structure, 
cavity or fluid of the recipient into which the genetically modified cell(s), host cell graft, or encapsulated host cell 
expression system can be implanted, without triggering adverse physiological consequences. Representative host cell- 
compatible sites include, for example, the peritoneal, pleural and pericardial cavities. Preferably, the host cell- 
compatible site communicates with the lymphatic system, thereby enabling delivery of the therapeutic agent to the 
10 vascular system. 

In one embodiment, the host cell-compatible site may be denuded prior to implanting the cells. Exemplary 
denuding methods include but are not limited to: (1) injection of distilled water into the site [e.g., the peritoneal cavity) 
for 20 minutes, followed by scraping off a portion of the epithelial layer; (2) injection of 0.1% buffered trypsin for 20 
minutes followed by scraping; (3) removal of epithelial cells by gentle scraping with a cell scraper and (4) touching a 

1 5 piece of Gelfilm (Upjohn, Kalamazoo, Ml) to the endothelium. 

The genetically modified host cells are implanted in a host cell-compatible site, alone or in combination with 
other genetically modified host cells. Thus, the instant invention embraces a method for modifying the epithelial 
system of a recipient by using a mixture of genetically modified host cells, such that a first modified cell expresses a 
first prophylactic or therapeutic agent of the present invention and a second modified cell expresses a second 

20 prophylactic or therapeutic agent. Other genetically modified cell types (e.g., hepatocytes, smooth muscle cells, 
fibroblasts, glial ceils, mesothelial cells or keratinocytes) can be added, together with the genetically altered epithelial 
cells, to produce expression of a complex set of introduced genes. Moreover, more than one recombinant gene can be 
introduced into each genetically modified cell on the same or different vectors, thereby allowing the expression of 
multiple prophylactic or therapeutic agents of the present invention by a single cell. 

25 The instant invention further embraces an epithelial cell graft. The graft comprises a plurality of the above- 

described genetically modified cells attached to a support that is suitable for implantation into a mammalian recipient, 
preferably into the oral cavity. The support can be formed of a natural or synthetic material. According to another 
aspect of the invention, an encapsulated host cell expression system is provided. The encapsulated system includes a 
capsule suitable for implantation into a mammalian recipient and a plurality of the above-described genetically 

30 modified host cells contained therein. The capsule can be formed of a synthetic or naturally-occurring material. The 
formulation of such capsules is known to one of ordinary skill in the art. In contrast to the host cells that are directly 
implanted into the mammalian recipient [i.e., implanted in a manner such that the genetically modified cells are in 
direct physical contact with the host cell-compatible site), the encapsulated cells remain isolated (i.e., not in direct 
physical contact with the site) following implantation. Thus, the encapsulated host cell system is not limited to a 
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capsule including genetically-modified non-immortalized host cells, but may contain genetically modified immortalized 
host cells. 

The invention has been disclosed broadly and illustrated in reference to representative embodiments 
described above. Those skilled in the art will recognize that various modifications can be made to the present 
5 invention without departing from the spirit and scope thereof. 
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We claim: 

I. A purified polypeptide comprising an amino acid sequence having at least 95% sequence identity with 
sequences selected from the group consisting of SEQ ID NQS: 1, 4, 6, 7, 8 and 9, wherein said polypeptide is capable 
of regulating signal transduction. 

5 2. The polypeptide of claim 1, wherein said polypeptide is capable of catalyzing the transfer of a phosphate 

group from a donor molecule to an acceptor molecule. 

3. An isolated DNA molecule encoding the polypeptide of claim 1. 

4. An isolated DNA molecule comprising the nucleotide sequence selected from the group consisting of SEQ 
ID NOS: 2, 3 and 5. 

10 5. A vector comprising the DNA molecule of claim 3. 

6. A host cell transformed with the vector of claim 5. 

7. The host cell of claim 6, wherein said host cell produces a polypeptide capable of regulating signal 
transduction. 

8. The host cell of claim 6, wherein said host cell produces a polypeptide capable of catalyzing the transfer 
15 of a phosphate group from a donor molecule to an acceptor molecule. 

9. An isolated antibody against the polypeptide of claim 1. 

10. The antibody of claim 9, wherein said antibody is a monoclonal antibody. 

II. The antibody of claim 9, wherein said antibody is capable of inhibiting the regulation of signal 
transduction. 

20 1 2. The antibody of claim 9, wherein said antibody is capable of inhibiting the transfer of a phosphate group 

from a donor molecule to an acceptor molecule. 

13. An isolated nucleic acid capable of hybridizing under high stringency conditions to the DNA molecule of 

claim 3. 

14. The isolated nucleic acid of claim 13, wherein said nucleic acid is capable of inhibiting the regulation of 
25 signal transduction. 

15. The isolated nucleic acid of claim 13, wherein said nucleic acid is capable of inhibiting said transfer of 
said phosphate group from said donor molecule to said acceptor molecule. 

16. A method of preventing or treating disease in a mammal comprising administering to said mammal an 
effective amount of material, selected from the group consisting of the polypeptide of claim 1 and antibody against 

30 said polypeptide, in a pharmaceutical^ acceptable sterile vehicle. 

17. The method of claim 16, wherein said mammal is a human. 

1 8. The method of claim 1 6, wherein said disease is cardiac disease. 
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19. A vaccine for preventing disease in a mamma! comprising administering to said mammal an effective 
amount of material, selected from the group consisting of the polypeptide of claim 1 and the antibody against said 
polypeptide, in a pharmaceutical^ acceptable sterile vehicle. 

20. The vaccine of claim 19, wherein said mammal is a human. 

5 21. The vaccine of claim 1 9, wherein said disease is cardiac disease. 

22. A method of preventing or treating disease in a mammal comprising administering to said mammal 
syngeneic cells transformed with the vector of claim 5, wherein said transformed syngeneic cells produce a 
polypeptide capable of regulating signal transduction. 

23. The method of claim 22, wherein said mammal is a human. 

1 0 24. The method of claim 22, wherein said disease is cardiac disease. 

25. A method of preventing or treating disease in a mammal comprising administering to said mammal 
syngeneic cells transformed with the vector of claim 5, wherein said transformed syngeneic cells produce a 
polypeptide capable of catalyzing the transfer of a phosphate group from a donor molecule to an acceptor molecule. 

26. The method of claim 25, wherein said mammal is a human. 

1 5 27. The method of claim 25, wherein said disease is cardiac disease. 

28. A kit for detecting the expression of a protein capable of regulating signal transduction, comprising the 
polypeptide of claim 1. 

29. The kit of claim 28, further comprising a detectable label selected from the group consisting of 
colorimetric, enzymatic, chemiluminescent, fluorescent and radioactive labels. 

20 30. A kit for detecting the expression of a protein capable of acting as a donor molecule or an acceptor 

molecule of a phosphate group, comprising the polypeptide of claim 1. 

31. The kit of claim 30, further comprising a detectable label selected from the group consisting of 
colorimetric, enzymatic, chemiluminescent, fluorescent and radioactive labels. 

32. A method for detecting the expression of a protein capable of regulating signal transduction, comprising 
25 contacting a sample with the polypeptide of claim 1, and detecting any effect of the sample on an indicator of signal 

transduction. 

33. The method of claim 32, wherein said polypeptide is immobilized to a solid support. 

34. The method of claim 32, wherein said phosphate group is detectably labeled. 

35. A method for detecting the expression of a protein capable of acting as a donor molecule or an acceptor 
30 molecule of a phosphate group, comprising contacting a sample with the polypeptide of claim 1, and detecting any 

transfer of said phosphate group. 

36. The method of claim 35, wherein said polypeptide is immobilized to a solid support. 

37. The method of claim 35, wherein said phosphate group is detectably labeled. 
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SEQUENCE LISTING 

<110> Wenlin Zeng 

Lawrence Stanton 

<120> Mediators of signal transduction 
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<170> FastSEQ for Windows Version 4.0 
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Glu 


Glu 


Ala 


Arg 


Ala 


Glu 


Ser 


Gin 


Ser 


Glu 


Glu 


Gin 


Gin 


GlU 


Ala 


Arg 




610 








615 




















Ala 


Glu 


Ser 


Pro 


Leu 


Pro 


Gin 


Val 


Ser 


Ala 


Arg 


Pro 


Val 


Pro 


Glu 


val 


625 










630 










635 










a a r\ 
04 U 


Gly Arg 


Ala 


Pro 


Thr 


Arg 


Ser 


Ser 


Pro 


Glu 


Pro 


Thr 


Pro 


Trp 


Glu 


Asp 










645 










650 










boo 




He 


Gly 


Gin 


Val 


Ser 


Leu 


Val 


Gin 


He 


Arg 


Asp 


Leu 


Ser 


Gly 


Asp 


Ala 






660 










665 










670 






Glu 


Ala 


Ala 


Asp 


Thr 


He 


Ser 


Leu 


Asp 


He 


Ser 


Glu 


Val 


Asp 


Pro 


Ala 






675 








680 










685 








Tyr 


Leu 


Asn 


Leu 


Ser 


Asp 


Leu 


Tyr 


Asp 


He 


Lys 


Tyr 


Leu 


Pro 


Phe 


Glu 


690 










695 










700 










Phe 


Met 


lie 


Phe 


Arg 


Lys 


Val 


Pro 


Lys 


Ser 


Ala 


Gin 


Pro 


Glu 


Pro 


Pro 


705 








710 










715 










720 


Ser 


Pro 


Met 


Ala 


Glu 


Glu 


Glu 


Leu 


.Ala 


Glu 


Phe 


Pro 


Glu 


Pro 


Thr 


Trp 
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725 730 735 



Pro 


Trp 


Pro 


Gly 


Glu Leu Gly 


Pro 


His 


Ala 


Gly Leu 


G±U 


lie 


Thr 


blU 








740 








745 








750 






Glu 


Ser 


Glu 


Asp 


Val Asp Ala 


Leu 


Leu 


Ala 


Glu Ala 


ax a 


Val 


Gly 


Arg 






755 






760 








n c c 
/ bb 








Lys Arg 


Lys 


Trp 


Ser 


Ser Pro 


Ser Arg 


Ser 


Leu Phe 


JilS 


Phe 


Pro 






770 








775 








780 










Arg 


His 


Leu 


Pro 


Leu 


Asp Glu 


Pro 


Ala 


Glu 


Leu Gly 


Leu 


Arg 


Glu 


Arg 


785 










790 








795 








oUU 


Val 


Lys 


Ala 


Ser 


Val 


Glu His 


He 


Ser 


Arg 


He Leu 


Lys 


Gly Arg 


Pro 








805 








810 








815 




Glu Gly 


Leu 


Glu 


Lys 


Glu Gly 


Pro 


Pro 


Arg 


Lys Lys 


Pro 


Gly 


Leu 


Ala 








820 








825 








830 






Ser 


Phe 


Arg 


Leu 


Ser 


Gly Leu 


Lys 


Ser 


Trp 


Asp Arg 


Ala 


Pro 


Thr 


Phe 






835 








840 








845 








Leu 


Arg 


Glu 


Leu 


Ser Asp Glu 


Thr 


Val 


Val 


Leu Gly 


Gin 


Ser 


Val 


Thr 




850 








855 








860 










Leu 


Ala 


Cys 


Gin 


Val 


Ser Ala 


Gin 


Pro 


Ala 


Ala Gin 


Ala 


Thr 


Trp 


Ser 


865 








870 








875 








880 


Lys Asp 


Gly Ala 


Pro 


Leu Glu 


Ser 


Ser 


Ser 


Arg Val 


Leu 


He 


Ser 


Ala 










885 








890 












Thr 


Leu 


Lys 


Asn 


Phe 


Gin Leu 


Leu 


Thr 


He 


Leu Val 


Val 


Val 


Ala 


Glu 






900 








905 








910 






Asp 


Leu 


Gly 


Val 


Tyr 


Thr Cys 


Ser 


Val 


Ser 


Asn Ala 


Leu 


Gly 


Thr 


Val 






915 








920 








925 








Thr 


Thr 


Thr 


Gly 


Val 


Leu Arg 


Lys 


Ala 


Glu 


Arg Pro 


Ser 


Ser 


Ser 


Pro 




930 








935 








940 










Cys 


Pro 


Asp 


He 


Gly Glu Val 


Tyr Ala Asp 


Gly Val 


Leu 


Leu 


Val 


Trp 


945 










950 








955 








960 


Lys 


Pro 


Val' Glu 


Ser 


Tyr Gly 


Pro 


Val 


Thr 


Tyr He. 


Val 


Gin 


Cys 


Ser 








965 








970 








975 




Leu 


Glu 


Gly Gly 


Ser 


Trp Thr 


Thr 


Leu 


Ala 


Ser Asp 


He 


Phe 


Asp 


Cys 








980 








985 








990 






Cys 


Tyr 


Leu 


Thr 


Ser 


Lys Leu 


Ser 


Arg 


Gly 


Gly Thr 


Tyr 


Thr 


Phe 


Arg 




995 








1000 






1005 






Thr 


Ala 


Cys 


Val 


Ser 


Lys Ala 


Gly Met 


Gly 


Pro Tyr 


Ser 


Ser 


Pro 


Ser 




1010 






1015 






1020 








Glu 


Gin 


Val 


Leu 


Leu 


Gly Gly 


Pro 


Ser 


His 


Leu Ala 


Ser 


Glu 


Glu 


Glu 


1025 








1030 








1035 








1040 


Ser 


Gin 


Gly Arg 


Ser 


Ala Gin 


Pro 


Leu 


Pro 


Ser Thr 


Lys 


Thr 


Phe 


Ala 



1045 1050 1055 



Phe Gin Thr Gin He Gin Arg Gly Arg Phe Ser Val Val Arg Gin Cys 

1060 1065 1070 

Trp Glu Lys Ala Ser Gly Arg Ala Leu Ala Ala Lys He lie Pro Tyr 

1075 1080 1085 

His Pro Lys Asp Lys Thr Ala Val Leu Arg Glu Tyr Glu Ala Leu Lys 

1090 1095 1100 

Gly Leu Arg His Pro His Leu Ala Gin Leu His Ala Ala Tyr Leu Ser 

1105 " 1110 1115 1120 

Pro Arg His Leu Val Leu lie Leu Glu Leu Cys Ser Gly Pro Glu Leu 

1125 1130 1135 

Leu Pro Cys Leu Ala Glu Arg Ala Ser Tyr Ser Glu Ser Glu Val Lys 

1140 1145 1150 

Asp Tyr Leu Trp Gin Met Leu Ser Ala Thr Gin Tyr Leu His Asn Gin 

1155 1160 1165 

His lie Leu His Leu Asp Leu Arg Ser Glu Asn Met lie lie Thr Glu 
1170 1175 H80 
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Tyr Asn Leu Leu Lys Val Val Asp Leu Gly Asn Ala Gin Ser Leu Ser 
1185 1190 1195 1200 

Gin Glu Lys Val Leu Pro Ser Asp Lys Phe Lys Asp Tyr Leu Glu Thr 

1205 1210 1215 

Met Ala Pro Glu Leu Leu Glu Gly Gin Gly Ala Val Pro Gin Thr Asp 

1220 1225 1230 

He Trp Ala He Gly Val Thr Ala Phe He Met Leu Ser Ala Glu Tyr 

1235 1240 1245 

Pro Val Ser Ser Glu Gly Ala Arg Asp Leu Gin Arg Gly Leu Arg Lys 

1250 1255 1260 

Gly Leu Val Arg Leu Ser Arg Cys Tyr Ala Gly Leu Ser Gly Gly Ala 
1265 1270 1275 1280 

Val Ala Phe Leu Arg Ser Thr Leu Cys Ala Gin Pro Trp Gly Arg Pro 

1285 1290 1295 

Cys Ala Ser Ser Cys Leu Gin Cys Pro Trp Leu Thr Glu Glu Gly Pro 

1300 1305 1310 

Ala Cys Ser Arg Pro Ala Pro Val Thr Phe Pro Thr Ala Arg Leu Arg 

1315 1320 1325 

Val Phe Val Arg Asn Arg Glu Lys Arg Arg Ala Leu Leu Tyr Lys Arg 

1330 1335 1340 

His Asn Leu Ala Gin Val Arg 
1345 1350 



<210> 2 

<211> 4175 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (4053) 



<400> 2 



gga 


gtg 


cag 


ttc 


age. 


cag 


tac 


ggc 


tec 


ate 


ate 


cag 


cag 


aac 


cct 


gtg 


age 


gaa 


ggt 


gtc 


ate 


tec 


tac 


etc 


age 


ctg 


acc 


gag 


agt 


gac 


cgt 


gee 


acc 


etc 


ctg 


aac 


tgg 


age 


age 


ccc 


atg 


get 


gec 


cac 


etc 


ate 


aag 


get 


acg 


ctg 


cag 


aga 


gee 


cct 


cag 


tgc 


etc 


tec 


cac 


ccc 


tgg 


ttc 


ctg 


gec 


cac 


ttc 


ate 


aac 


acc 


aag 


cag 


etc 


cgc 


tgg 


cag 


cgt 


tec 


ctg 


atg 


age 


tac 


tec 


ate 


cct 


gag 


ctg 


ctg 


egg 


ggc 


cca 


gta 


gec 


egg 


cac 


etc 


tgc 


agg 


gac 


act 


tec 


tec 


tct 


gac 


aac 


gag 


etc 


gee 


cca 


cca 


ccc 


tec 


ccg 


gtg 


aca 


cac 


tea 


cca 


ctg 


egg 


ccc 


teg 


gee 


age 


ctg 


cct 


gag 


tec 


acc 


gag 


gec 


cca 


get 


ccg 


cct 


gca 


ccg 


gee 


gee 


cag 


ggc 


tgc 


gtg 


ccc 


egg 


ttc 


tac 


cac 


cag 


gcg 


ggt 


gag 


age 


cct 


ggg 


age 


agg 


egg 


cac 


ccg 


gec 


egg 


egg 


tac 


att 


gcg 


ggg 


gcg 


ctg 


cca 


ggc 


ctg 


cgc 


gtg 


ctg 


gag 


gag 


gag 


gee 


gee 


.agg 



cct 


gag 


ttc 


gtc 


tec 


ccc 


gag 


48 


gee 


tec 


gac 


att 


tgg 


gee 


atg 


96 


tgc 


tea 


tec 


cca 


ttt 


gee 


ggc 


144 


gtc 


ctg 


gag 


ggg 


cgc 


gtg 


tea 


192 


age 


gaa 


gac 


gee 


aaa 


gac 


ttc 


240 


cag 


gee 


egg 


cct 


agt 


gcg 


gee 


288 


aaa 


tec 


atg 


cct 


gcg 


gag 


gag 


336 


aag 


ttc 


etc 


ctg 


gee 


cga 


agt 


384 


aag 


tec 


ate 


ctg 


gtg 


atg 


cgc 


432 


ccc 


gac 


age 


ccc 


tec 


etc 


ggc 


480 


ggt 


ggc 


tec 


tec 


agt 


tec 


tec 


528 


ttt 


gec 


egg 


get 


aag 


tea 


ctg 


576 


ctg 


ctg 


cac 


ccc 


egg 


ggc 


ttc 


624 


gaa 


gec 


gag 


gec 


agt 


gag 


cgc 


672 


tct 


ccc 


gag 


ggt 


gec 


ggg 


cca 


720 


cac 


age 


gtc 


ate 


cgc 


age 


ctg 


768 


gag 


cac 


ggg 


gec 


ctg 


gec 


ccg 


816 


egg 


cac 


ctg 


ctg 


aag 


ggc 


ggg 


864 


cgc 


gag 


cca 


ctg 


atg 


gag 


cac 


912 


gag 


gag 


cag 


gec 


acc 


etc 


ctg 


96C 
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CfCC 


aaa 


gec 


c* C* (~> 

tit 


ltd 


ttc 


nan 


art 


nor* 
ycc 


ct c 


egg 


rt n 
1 1 y 


rrt 

1 1 1 


gee 


tct 


aac 
yyt 


1008 


a cc 


cac 


1 1 n 

ttg 


gec 


tO L 


nnn 

ggc 


cac 


age 


cac 


tee 


ctg 


gaa 


nat 
t d l 


gac 


tct 


ccg 


1056 


agC 


ace 


ccc 


cgc 


CCC 




teg 


gag 


gee 


tgc 


ggt 


gag 


gca 


cag 


cga 


eta 
t ly 


1104 


cc t 


tea 


nnn 
y 1 1 


ccc 


4- /T" 


nnn 

yyy 


nnn 

ggg 


gec 


1 1 l 


all 


agg 


n 

y a t 


atg 


nnn 

yyy 


cac 


cct 


1152 


ca g 


yyt 




day 


cag 


ctt 


cca 


tec 


act 


ggt 


nnn 

ggt 


cac 


cca 


nnn 
yy t 


act 


get 


1200 


cag 


1 a 


gag 


a n /T 

agg 


Cud 


tec 


nnn 
tty 


yai 


a nn 
ay t 


tt 1 


4- 

tgg 


ggg 


nan 
t d y 


cca 


gee 


cct 


1248 




tgc 


cac 


ccc 


day 


na n 
Lay 


nnt 

ggt 


t rt 

It L 


gee 


ccc 


cag 


gag 


nnn 

ggc 


tgc 


age 


ccc 


1296 


CSC 


cca 


gca 


ntt 

gtt 


nnn 
y t 


cca 


tgc 


cct 


cct 


ggc 


t" nn 
1 1 1 


lit 


cct 


cca 


yyd 


tct 


1344 


tgc 


aaa 


gag 


gee 


nnn 


tta 


y la 


LlL. 


Ltd 


age 


1 1 1 


tt n 

lit 


t* ta 

iiy 


nna 
yyd 


cag 


ccc 


1392 


cag 


gca 


ccc 


cct 


gec 


cct 


gec 


aaa 


gca 


age 


ccc 


cca 


tt n 

ttg 


nan 
y d t 


tct 


aag 


1440 


at g 


ggg 


cct 


gga 


gac 


out 


1- nt- 

ni 


y-4- 4" 

1 1 1 


cct 


ggg 


agg 


cca 


aaa 


ccc 


nnn 
yyt 


ccc 


1488 


t gc 


a nt- 

agt 


tec 


cca 


ggg 


t na 

LCd 


gee 


tec 


cag 


gcg 


age 


t ct 


tec 


caa 


gtg 


age 


1536 


tec 


ct c 


agg 


gtg 


gg° 


tec 


tec 


cag 


gtg 


ggc 


aca 


gag 


cct 


ggc 


nnn 

1 1 1 


t nn 

L tt 


1584 


ct g 


gat 


gcg 


gag 


ggc 


tgg 


ace 


cag 


gag 


get 


gag 


gat 


ctg 


tec 


gac 


tec 


1 fi^9 

J. \J*J£. 


aca 


ccc 


ace 


ttg 


cag 


egg 


cct 


cag 


gaa 


cag 


gcg 


acc 


at g 


cgc 


aag 


lit 


J. DO \J 


tec 


ctg 


ggt 


ggt 


cgc 


ggg 


ggc 


tac 


gca 


ggc 


gtg 


get 


ggc 


•hat- 
id l 


ggc 


ace 


1728 


III 


gee 


4-4-4- 
ttt 


ggt 


gga 


gat 


gca 


ggg 


ggc 


ra 4- ft 

atg 


ctg 


ggg 


cag 


ggg 


ccc 


af- n 
at g 




tgg 


gee 


agg 


at a 


gec 


tgg 


get 


gtg 


tec 


cag 


4- 0 
tea 


gag 


gag 


gag 


gag 


cag 


1824 


gag 


gag 


gee 


agg 


get 


gag 


tec 


cag 


teg 


gag 


gag 


cag 


cag 


gag 


gee 


agg 


l R79 


get 


gag 


age 


cca 


ctg 


ccc 


cag 


gtc 


agt 


gca 


agg 


cct 


gtg 


cct 


gag 


gt c 


1 990 


ggc 


agg 


get 


ccc 


ace 


agg 


age 


1 1 1 


cca 


gag 


ccc 


acc 


cca 


tgg 


gag 


na n 
y d t 


1968 


at- n 


ggg 


cag 


gtc 


tec 


ctg 


gtg 


cag 


at C 


egg 


gac 


ctg 


4- n -x 

ltd 


ggt 


yd 1 


gcg 


2016 


gag 


gcg 


gee 


ga c 


aca 


-a 4- 

dud 


tec 


ctg 


gac 


d L I 


tec 


gag 


gtg 


gac 


CCC 


gec 


2064 


far 


etc 


aac 


etc 


4- ~ -a 
t Co 


gac 


ctg 




gat 


af r 
atC 


aag 


t a n 
Id t 


etc 


cca 


1 1 n 
lit 


gag 


2112 


t- 1* t- 


atg 


ate 


+- 4- /- 

tt c 


agg 


aaa 


gtc 


ccc 


aag 


tec 


get 


cag 


cca 


gag 


ccg 


ccc 


2160 


tec 


CCC 


atg 


get 


gag 


gag 


gag 


ctg 


gee 


gag 


ttc 


ccg 


gag 


ccc 


acg 


tgg 


2208 


ccc 


tgg 


cca 


ggt 


gaa 


ctg 


ggc 


ccc 


cac 


gca 


ggc 


ctg 


gag 


at* n 
d 1 1 


a na 
d td 


na n 
ydy 


2256 


gag 


tea 


gag 


gat 


gtg 


gac 


gcg 


ctg 


ctg 


gca 


gag 


get 


gee 


gtg 


ggc 


agg 


9"304 


aag 


cgc 


aag 


tgg 


tec 


teg 


ccg 


4- <-» =s 

tea 


cgc 


age 


etc 


ttc 


cac 


1- 1 n 

ttc 


cct 


ggg 


9 9 


agg 


cac 


ctg 


ccg 


ctg 


gat 


gag 


cct 


gca 


gag 


ctg 


ggg 


ctg 


cgt 


gag 


aga 


fl UU 


gtg 


aag 


gee 


tec 


gtg 


gag 


cac 


ate 


tec 


egg 


ate 


ctg 


aag 


ggc 


agg 


ccg 


9 4 4 R 


gaa 


ggt 


ctg 


gag 


aag 


gag 


ggg 


ccc 


ccc 


agg 


aag 


aag 


cca 


ggc 


nt- -fr- 
Ct t 


get 


1 1 J D 


tec 


1 1 c 


egg 


etc 


4- r- ^ 

tea 


ggt 


ctg 


aag 


age 


tgg 


gac 


cga 


gcg 


ccg 


aca 


Lit 


9^4 4 


eta 


agg 


gag 


etc 


4- -i 

tea 


gat 


gag 


act 


gtg 


gtc 


ctg 


ggc 


cag 


f* na 
l td 


gtg 


aca 




ctg 


gee 


tgc 


cag 


gtg 


tea 


gee 


cag 


cca 


get 


gec 


cag 


gee 


acc 


tgg 


age 


9 f>4 n 


aaa 


gac 


gga 


gee 


ccc 


ctg 


gag 


age 


age 


age 


cgt 


gtc 


etc 


afp 

d 1 1 


4- p.4- 
L 1 1 


gec 


96RR 


ace 


etc 


aag 


aac 


4- +- /— 

ttc 


cag 


ctt 


ctg 


acc 


ate 


ctg 


gtg 


gtg 


gtg 


get 


gag 


9 7 ^ 6 


gac 


ctg 


ggt 


gtg 


4- n 

tac 


ace 


tgc 


age 


gtg 


age 


aat 


gcg 


ctg 


ggg 


aca 


gtg 


97 R 4 


a cc 


ace 


acg 


ggc 


gtc 


ct c 


egg 


aag 


gca 


gag 


cgc 


ccc 


t na 
ltd 


tnt 
it 1 


t eg 


cca 


2832 


t gc 


ccg 


y-rn 4- 

gat 


at c 


ggg 


gag 


gtg 


4- -a i-i 

tac 


gcg 


gat 


ggg 


gtg 


ctg 


ctg 


gtc 


tgg 


9ft RO 


aag 


ccc 


gtg 


gaa 


tec 


tac 


ggc 


cct 


gtg 


acc 


4- /— 

tac 


= 4- 4> 

att 


gtg 


cag 


tgc 


age 


9 Q9R 


eta 


gaa 


ggc 


ggc 


age 


tgg 


acc 


aca 


ctg 


gee 


tec 


gac 


ate 


t tt 


gac 


tgc 


9 Q7 fi 


t gc 


tac 


ctg 


ace 


age 


aag 


etc 


tec 


egg 


ggt 


ggc 


acc 


1- a n 
I d t 


acc 


1 1 n 

lit 


cgc 




acg 


gca 


4- r^t 4- 

tgt 


gtc 


age 


aag 


gca 


gga 


=1 4- rr 

atg 


ggt 


ccc 


4- a r> 

tac 


age 


age 


ccc 


teg 


*W79 
J ly / £ 


gag 


caa 


gtc 


ct c 


ctg 


gga 


ggg 


ccc 


age 


ca c 


ctg 


gee 


4- ^-,4- 

1 1 1 


gag 


gag 


gag 


^1 90 


age 


cag 


ggg 


egg 


4- r* n 

tea 


gec 


caa 


ccc 


ctg 


ccc _ 


age 


aca 


aag 


acc 


4- 4- r- 
lit 




3168 


LUC 


cag 


aca 


cag 


ate 


cag 


agg 


ggc 


cgc 


ttc 


age 


gtg 


gtg 


nnn 
tgg 


caa 


tgc 


3216 


tgg 


gag 


aag 


gec 


age 


ggg 


egg 


gcg 


ctg 


gec 


nnn 

y -t 


aag 


ate 


ate 


ccc 


tac 


3264 


cac 


ccc 


aag 


gac 


aag 


aca 


gca 


gtg 


ctg 


cgc 


gaa 


tac 


gag 


gec 


etc 


aag 


331-2 


ggc 


ctg 


cgc 


cac 


ccg 


cac 


ctg 


gee 


cag 


ctg 


cac 


gca 


gec 


tac 


etc 


age 


3360 


ccc 


egg 


cac 


ctg 


gtg 


etc 


ate 


ttg 


gag 


ctg 


tgc 


tct 


ggg 


ccc 


gag 


ctg 


3408 


etc 


ccc 


tgc 


ctg 


gee 


gag 


agg 


gee 


tec 


tac 


tea 


gaa 


tec 


gag 


gtg 


aag 


3456 


gac 


tac 


ctg 


tgg 


cag 


atg 


ttg 


agt 


gee 


acc 


cag 


tac 


ctg 


cac 


aac 


cag 


3504 


cac 


ate 


ctg 


cac 


ctg 


gac 


ctg 


agg 


tec 


gag 


aac 


atg 


ate 


ate 


acc 


gaa 


3552 


tac 


aac 


ctg 


etc 


aag 


gtc 


gtg 


gac 


ctg 


ggc 


aat 


gca 


cag 


age 


etc 


age 


3600 


cag 


gag 


aag 


gtg 


ctg 


ccc 


tea 


gac 


aag 


ttc 


aag 


gac 


tac 


eta 


gag 


acc 


3648 


atg 


get 


cca 


gag 


etc 


ctg 


gag 


ggc 


.cag 


ggg 


get 


gtt 


cca 


cag 


aca 


gac 


3696 
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ate tgg gec ate ggt gtg aca gec ttc ate atg ctg age gee gag tac 3744 

ccg gtg age age gag ggt gca cgc gac ctg cag aga gga ctg cgc aag 
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210 










215 










220 










Val 


Leu 


His 


Leu 


Asp 


He 


Lys 


Pro 


.Ser 


Asn 


He 


Leu 


Met 


Val 


His 


Pro 
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225 




















235 










240 


Ala Arg Glu Asp 


lie 


Lys 


lie 


Cys 


Asp 


Phe 


Gly 


Phe 


Ala 


Gin 


Asn 


He 




















250 










255 




Thr 


Pro 


Ala 


Glu 


Leu 


bin 


rne 


Ser 


bin 


Tyr Gly Ser 


Pro 


Glu 


Phe 


Val 








260 










265 










270 






Ser 


Pro 


Glu 


lie 


lie 


bin 


bin 


Asn 


Pro 


Val 


Ser 


Glu 


Ala 


Ser Asp 


lie 






275 










280 










285 








Trp Ala Met 


Gly 


vai 


lie 


ber 


Tyr 


Leu 


Ser 


Leu 


Thr 


Cys 


Ser 


Ser 


Pro 




290 




















300 










Phe 


Ala 


Gly 


Glu 


C a -i- 


Asp 


Arg 


r\Xa. 


Thr 


Leu 


Leu 


Asn 


Val 


Leu Glu Gly 


305 








n n 










315 










320 


Arg 


Val 


Ser 


Trp 


Ser 


ber 


Pro 


Mer, 


Ala 


Ala 


His 


Leu 


Ser 


Glu Asp Ala 






*a o r 










330 










335 




Lys 


Asp 


Phe 


He 


Lys 


Ala 


OIL ~- 

i nr 


Leu 


Gin Arg Ala 


Pro Gin Ala Arg 


Pro 






340 










345 










350 






Ser 


Ala 


Ala 
355 


Gin 


Cys 


Leu 


Ser 


His 
360 


Pro 


Trp 


Phe 


Leu 


Lys 
365 


Ser 


Met 


Pro 


Ala 


Glu 


Glu 


Ala 


HIS 


rfle 


He 


Asn 


Thr 


Lys 


Gin 


Leu 


Lys 


Phe 


Leu 


Leu 




370 










375 








380 










Ala 


Arg 


Ser Arg 


Trp 


Ply, 

bin 


Arg 


Ser 


Leu 


Met 


Ser 


Tyr 


Lys 


Ser 


He 


Leu 


385 


















395 










400 


Val 


Met 


Arg 


Ser 


ti- 
ne 


Pro 


Glu 


Leu 


Leu Arg 


Gly 


Pro 


Pro Asp 


Ser 


Pro 








4 05 










410 










415 




Ser 


Leu Gly Val 


Ala 


Arg 


His Leu Cys Arg Asp Thr Gly Gly 


Ser 


Ser 








420 










425 










430 






Ser 


Ser 


Ser 
435 


Ser 


Ser 


Ser 


Asp 


Asn 
440 


Glu 


Leu 


Ala 


Pro 


Phe 
445 


Ala 


Arg 


Ala 


Lys 


Ser 


Leu 


Pro 


Pro 


Ser 


Pro 


Val 


Thr 


His 


Ser 


Pro 


Leu 


Leu 


His 


Pro 


450 










455 










460 










Arg 


Gly 


Phe 


Leu 


Arg 


Pro 


Ser 


Ala 


Ser 


Leu 


Pro 


Glu 


Glu 


Ala 


Glu 


Ala 


465 






470 










475 










480 


Ser 


Glu Arg 


Ser 


Thr 


Glu 


Ala 


Pro 


Ala 


Pro 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 










485 










490 










495 




Ala 


Gly 


Pro 


Pro 


Ala 


Ala 


Gin Gly 


Cys 


Val 


Pro Arg 


His 


Ser 


Val 


He 






500 










505 










510 






Arg 


Ser 


Leu 


Phe 


Tyr 


His 


Gin 


Ala 


Gly Glu 


Ser 


Pro 


Glu 


His 


Gly Ala 




515 










520 










525 








Leu 


Ala 


Pro 


Gly 


Ser Arg 


Arg 


His 


Pro 


Ala 


Arg 


Arg 


Arg 


His 


Leu 


Leu 




530 










535 










540 










Lys 


Gly 


Gly 


Tyr 


He 


Ala 


Gly Ala 


Leu 


Pro 


Gly Leu Arg 


Glu 


Pro 


Leu 


545 










550 










555 










5 60 


Met 


Glu 


His 


Arg 


Val 


Leu 


Glu 


Glu 


Glu 


Ala 


Ala 


Arg 


Glu 


Glu 


Gin 


Ala 








565 










570 










575 




Thr 


Leu 


Leu 


Ala 


Lys 


Ala 


Pro 


Ser 


Phe 


Glu 


Thr 


Ala 


Leu 


Arg 


Leu 


Pro 








580 








585 










590 






Ala 


Ser Gly Thr 


His 


Leu 


Ala 


Pro 


Gly 


His 


Ser 


His 


Ser 


Leu 


Glu 


His 






595 










600 










605 








Asp 


Ser 


Pro 


Ser 


Thr 


Pro 


Arg 


Pro 


Ser 


Ser Glu Ala Cys Gly Glu 


Ala 


610 










615 










620 










Gin Arg Leu 


Pro 


Ser 


Ala 


Pro Ser Gly Gly Ala 


Pro 


He Arg Asp 


Met 


625 










630 










635 










640 


Gly 


His 


Pro 


Gin 


Gly 


Ser 


Lys 


Gin 


Leu 


Pro Ser Thr Gly Gly 


His 


Pro 








645 










650 










655 




Gly 


Thr 


Ala 


Gin 


Pro 


Glu 


Arg 


Pro 


Ser 


Pro 


Asp 


Ser 


Pro Trp Gly Gin 






660 










665 










670 






Pro 


Ala 


Pro 


Phe 


Cys 


His 


Pro 


Lys 


Gin 


Gly 


Ser 


Ala 


Pro 


Gin 


Glu 


Gly 



675 680.. 685 
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Cys Ser 


Pro 


His 


Pro 


Ala 


Val 


Ala 


Pro 


Cys 


Pro 


Pro 


biy 




php 


C X \J 


con 










695 










7nn 

'UU 










Pro Gly 


Ser 


Cys 


Lys 


blU 


Ala 


Pro 


Leu 


vai 


Pro 




Cor 


Pro 


Phe 


Leu 


f\JD 






/ 1 U 










/ J. -J 










720 


Gly bin 


Pro 


bin 


/\ia 


Pro 


Pro 


Ala 


Pro 




Lys 


Al p 
/A J. a 


Ser 


Pro 


Pro 


Leu 






1 AO 










i ^n 










735 




Asp Ser 


Lys 


Met 


biy 


Pro 


Gly Asp 




Ser 


Leu 


c -L VJ 




Arg 


Pro 


Lys 












7 4 S 










750 






Pro Gly 


Pro 


Cys 


Ser 


Ser 


Pro Gly 


OCX 


Al a 


ber 


pi n 

bill 


nld 




Ser 


Ser 


"7 c, c. 










760 










765 








bin val 


C -»- 


ber 


Leu 


Arg 


Val 


Gly 


Ser 


Cor 




Val 


Gl v 


Thr 


Glu 


Pro 












775 










780 










Gly Pro 


Ser 


Leu 


Asp 


TV "1 _. 

Ala 


Glu Gly 


Trp 


i nr 


bin 


pi n 


nld 


OiU 


A Qn 


Leu 


785 








*7 nn 

7 90 










/ yo 










Ann 

0 u u 


Ser Asp 


Ser 


Thr 


Pro 


i nr 


Leu 


Gin 


Arg 


rro 


bin 


bJ.U 


pi n 




Thr 


Met 






one 










o i u 










OiJ 




Arg Lys 


Phe 


Ser 


Leu 


Gly 


Gly Arg 


biy 


biy 


Tyr 


Ala 


pit; 


val 


Al a 


Gly 




820 










825 










830 






Tyr Gly 


Thr 


Phe 


Ala 


Phe 


Gly Gly 


Asp 


Ala 


biy 


pi \r 
biy 


1*1 c U 


Leu 


uiy 






835 










840 










ft 4 S 

O H J 








Gly Pro 


Met 


Trp 


Ala 


Arg 


He 


Ala 


Trp 


TV 1 -» 

Ala 


vai 


C a -r 

ber 


bin 




Pin 


Glu 


850 










855 










oou 










Glu Glu 


Gin 


Glu 


Glu 


Ala 


Arg Ala 


blU 


Ser 


bin 


Ser 


pi n 

Ul u 


fill? 
bi li 


Pi n 


Gin 


8 65 








o "7 r\ 

870 










Q~7 c: 










O O \J 


Glu Ala 


Arg Ala 


Glu 


Ser 


Pro 


Leu 


Pro 


bin 


vai 


ber 


/iia 


Arg 


T3 -y- O 

IT X U 


Va 1 
vai 








885 










890 










895 




Pro Glu 


Val 


Gly 
900 


Arg 


Ala 


Pro 


Thr 


Arg 
905 


Ser 


ber 


Pro 


blU 


Pro 
910 


l nr 


Pro 


Trp Glu 


Asp 


He 


Gly 


Gin 


Val 


Ser 


Leu 


val 


bin 


Tin 

lie 


Arg 


Asp 


Leu 


OCi 


915 










920 


















Gly Asp 


Ala 


Glu 


TV T 

Ala 


TV "I _ 

Ala 


Asp 


Thr 


He 


C o -r- 

ber 


Leu 


Asp 


lie 


Ser 


wl Li 


Val 


930 










935 




















Asp Pro 


Ala 


Tyr 


Leu 


Asn 


Leu 


Ser 


Asp 


Leu 


Tyr 


Asp 


lie 


Lys 


T\7T 

i yx 


Leu 


945 








you 










^> j 










960 


Pro Phe 


Glu 


Phe 


Met 

n a n 

965 


He 


Phe 


Arg 


Lys 


v ai 

Q"7 A 

y / u 


Pro 


Lys 


Ser 


Ala 


Gin 
Q7 S 


Pro 


Glu Pro 


Pro 


Ser 
980 


Pro 


Met 


Ala 


Glu 


Glu 
985 


Glu 


Leu 


Ala 


Glu 


Phe 
990 


Pro 


Glu 


Pro Thr 


Trp 


Pro 


Trp 


Pro 


Gly 


Glu 


Leu 


Gly 


Pro 


His 


Ala 


Gly 


Leu 


Glu 




995 










1000 








1005 






He Thr 


Glu 


Glu 


Ser 


Glu 


Asp 


Val Asp 


Ala 


Leu 


Leu 


Ala. Glu 


Ala 


Ala 


1010 








1015 








1020 








Val Gly Arg 


Lys 


Arg 


Lys 


Trp 


Ser 


Ser 


Pro 


Ser 


Arg 


Ser 


Leu 


Phe 


His 


1025 








1030 








1035 








1040 


Phe Pro 


Gly Arg 


His 


Leu 


Pro 


Leu Asp 


Glu 


Pro 


Ala 


Glu 


Leu 


Gly 


Leu 








1045 








1050 








1055 


Arg Glu Arg 


Val 


Lys 


Ala 


Ser 


Val 


Glu 


His 


He 


Ser Arg 


lie 


Leu 


Lys 






1060 








1065 








1070 




Gly Arg 


Pro 


Glu 


Gly 


Leu 


Glu 


Lys 


Glu 


Gly 


Pro 


Pro 


Arg 


Lys 


Lys 


Pro 




1075 








1080 








1085 






Gly Leu 


Ala 


Ser 


Phe 


Arg 


Leu 


Ser 


Gly 


Leu 


Lys 


Ser 


Trp 


Asp 


Arg 


Ala 


1090 








1095 








1100 








. Pro Thr 


Phe 


Leu 


Arg 


Glu 


Leu 


Ser 


Asp 


Glu 


Thr 


Val 


Val 


Leu 


Gly 


Gin 


1105 






1110 








1115 








1120 


Ser Val 


Thr 


Leu 


Ala 


Cys 


Gin 


Val 


Ser 


Ala 


Gin 


Pro 


Ala 


Ala 


Gin 


Ala 








1125 








1130 








1135 


Thr Trp Ser Lys Asp Gly Ala 


Pro 


.Leu 


Glu 


Ser 


Ser 


Ser 


Arg 


Val 


Leu 
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1140 1145 1150 

lie Ser Ala Thr Leu Lys Asn Phe Gin Leu Leu Thr lie Leu Val Val 

1155 1160 1165 

Val Ala Glu Asp Leu Gly Val Tyr Thr Cys Ser Val Ser Asn Ala Leu 

1170 ~ 1175 1180 

Gly Thr Val Thr Thr Thr Gly Val Leu Arg Lys Ala Glu Arg Pro Ser 
1185 1190 1195 1200 

Ser Ser Pro Cys Pro Asp He Gly Glu Val Tyr Ala Asp Gly Val Leu 

1205 1210 1215 

Leu Val Trp Lys Pro Val Glu Ser Tyr Gly Pro Val Thr Tyr He Val 

1220 1225 1230 

Gin Cys Ser Leu Glu Gly Gly Ser Trp Thr Thr Leu Ala Ser Asp He 

1235 " 1240 1245 

Phe Asp Cys Cys Tyr Leu Thr Ser Lys Leu Ser Arg Gly Gly Thr Tyr 

1250 1255 1260 

Thr Phe Arg Thr Ala Cys Val Ser Lys Ala Gly Met Gly Pro Tyr Ser 
1265 " 1270 1275 1280 

Ser Pro Ser Glu Gin Val Leu Leu Gly Ala Pro Ser His Leu Ala Ser 

1285 1290 1295 

Glu Glu Glu Ser Gin Gly Arg Ser Ala Gin Pro Leu Pro Ser Thr Lys 

1300 1305 1310 

Thr Phe Ala Phe Gin Thr Gin He Gin Arg Gly Arg Phe Ser Val Val 

1315 1320 1325 

Arg Gin Cys Trp Glu Lys Ala Ser Gly Arg Ala Leu Ala Ala Lys He 

1330 1335 1340 

He Pro Tyr His Pro Lys Asp Lys Thr Ala Val Leu Arg Glu Tyr Glu 
1345 1350 1355 1360 

Ala Leu Lys Gly Leu Arg His Pro His Leu Ala Gin Leu His Ala Ala 

1365 1370 1375 

Tyr Leu Ser Pro Arg His Leu Val Leu He Leu Glu Leu Cys Ser Gly 

1380 1385 1390 

Pro Glu Leu Leu Pro Cys Leu Ala Glu Arg Ala Ser Tyr Ser Glu Ser 

1395 1400 1405 

Glu Val Lys Asp Tyr Leu Trp Gin Met Leu Ser Ala Thr Gin Tyr Leu 

1410 1415 1420 

His Asn Gin His lie Leu His Leu Asp Leu Arg Ser Glu Asn Met lie 
1425 1430 1435 1440 

lie Thr Glu Tyr Asn Leu Leu Lys Val Val Asp Leu Gly Asn Ala Gin 

1445 1450 1455 

Ser Leu Ser Gin Glu Lys Val Leu Pro Ser Asp Lys Phe Lys Asp Tyr 

1460 J 1465 1470 

Leu Glu Thr Met Ala Pro Glu Leu Leu Glu Gly Gin Gly Ala Val Pro 

1475 1480 1485 

Gin Thr Asp He Trp Ala He Gly Val Thr Ala Phe lie Met Leu Ser 

1490 1495 1500 

Ala Glu Tyr Pro Val Ser Ser Glu Gly Ala Arg Asp Leu Gin Arg Gly 
1505 1510 1515 1520 

Leu Arg Lys Gly Leu Val Arg Leu Ser Arg Cys Tyr Ala Gly Leu Ser 

1525 1530 1535 

Gly Gly Ala Val Ala Phe Leu Arg Ser Thr Leu Cys Ala Gin Pro Trp 

1540 1545 1550 

Gly Arg Pro Cys Ala Ser Ser Cys Leu Gin Cys Pro Trp Leu Thr Glu 

1555 1560 1565 

Glu Gly Pro Ala Cys Ser Arg Pro Ala Pro Val Thr Phe Pro Thr Ala 

1570 * 1575 1580 

Arg Leu Arg Val Phe Val Arg Asn Arg Glu Lys Arc Arg Ala Leu Leu 
1585 1590 . 1595 160C 



11 



WO 00/63381 



PCT/US00/09488 



Tyr Lys Arg His Asn Leu Ala Gin Val Arg 
1605 1610 



107 
155 
203 



<210> 5 
<211> 7928 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 

<222> (60) . . . (7847) 
<4 00> 5 

gaattcggcg cgccagatat cacacgtgcc aaggggctgg ctcactggtg ccgccccga 59 
atg ctg gag agg ttc acc ccc aag aaa gtg aag aaa ggc tec age ate 
ace ttc tct gtg aag gta gaa gga cgc ccg gtg ccc acc gtg cac tgg 
etc agg gag gag get gag aga ggc gtg ctg tgg att ggc cct gac aca 
ccg ggc tac acc gtg gee age tct gcg cag cag cac age ctg gtc ctg 251 
ctg gac gtg ggc egg cag cac cag ggc acc tac aca tgc att gee age 299 
aac get gee ggc cag gee etc tgc tec. gee age ctg cac gtc teg ggc 347 
ctg cct aag gtg gag gag cag gag aaa gtg aag gaa gcg ctg att tec 395 
act ttc ctg cag ggg acc aca caa gee ate tea gca cag ggg ttg gaa 443 
act gcg agt ttt get gac ctt ggt ggg cag agg aaa gaa gag cct ctg 491 
get gec aag gag gee etc ggc cac ctg tec etc get gag gtg ggc aca 
gag gag ttc ctg cag aaa ctg acc tec cag ate act gag atg gta teg 
gee aag ate acg cag gec aag ctg cag gtg cca gga ggt gac agt gat 
gag gac tec aag aca cca tct gca tec ccc cgc cat ggc cga tea egg 
cca tec tec age ate cag gag tct tec tea gag tea gag gac ggc gat 
gec cga ggc gag ate ttt gac ate tac gtg gtc' acc get gac tac ctg 779 
ccc eta ggg get gag cag gat gee ate acg ctg egg gaa ggc cag tat 
gtg gag gtc ctg gat gca gee cac cca ctg cgc tgg ctt gtc cgc acc 
aag ccc acc aag tec age ccc tea egg cag ggc tgg gtg tea cca gee 



539 
587 
635 
683 
7 31 



827 
875 
923 



tac ctg gac agg agg etc aag ctg tea cct gag tgg ggg gee get gag 971 
gec cct gag ttc cct ggg gag get gtg tct gaa gac gaa tac aag gca 1019 
agg ctg age tct gtg ate cag gag ctg ctg agt tct gag cag gee ttc 1067 
gtg gag gag ctg cag ttc ctg cag age cac cac ctg cag cac ctg gag 1115 
cgc tgc ccc cac gtg ccc ata gee gtg gee ggc cag aag gca gtc ate 1163 
ttc cgc aat gtg egg gac ate ggc cgc ttc cac age age ttc ctg cag 1211 
gag ttg cag cag tgc gac acg gac gac gac gtg gec atg tgc ttc ate 1259 
aag aac cag gcg gee ttt gag cag tac ctg gag ttc ctg gtg ggg cgt 1307 
gtg cag get gag teg gtg gtc gtc age acg gee ate cag gag ttc tac 1355 
aag aaa tac gcg gag gag gee ctg ttg gca ggg gac ccc tct cag ccc 
ccg cca cca cct ctg cag cac tac ctg gag cag cca gtg gag egg gtg 
cag cgc tac cag gee ttg ctg aag gag ttg ate cgc aac aag gcg egg 
aac aga cag aac tgc gcg ctg ctg gag cag gee tat gee gtg gtg tct 1547 
gee ctg cca cag cgc get gag. aac aag ctg cac gtg tec etc atg gag 1595 
aac tac cca ggc acc ctg gag gee ctg ggc gag ccc ate cgc cag ggc 
cac ttc ate gtg tgg gag ggt gca ccg ggg gee cgc atg ccc tgg aag 
ggc cac aac cgt cac gtg ttc etc. .ttc cgc aac cac ctg gta ate tgc 



1403 
1451 
1499 



1643 
1691 
1739 
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aag ccc egg cga gac tec cgc acc gat acc gtc age tac gtg ttc egg 1787 

aac atg atg aag ctg age age ate gac ctg aac gac cag gtg gag ggg 1835 

gat gac cgc gec ttc gag gtg tgg cag gag egg gag gac teg gtg cgc 1883 

aag tac ctg ctg cag gca egg aca gee att ate aag age teg tgg gtg 1931 

aag gag ate tgt ggc ate cag cag cgt ctg gee ctg cct gtg tgg egg 1979 

ccc ccg gac ttt gaa gag gag ctg gee gac tgc aca gee gag ctg ggt 2027 

gag aca gtc aag ctg gee tgc cgc gtg acg ggc aca ccc aag cct gtc 2075 

ate age. tgg tac aaa gat ggg aaa gca gtg cag gtg gac ccc cac cac 2123 

ate etc att gaa gac cct gat ggc teg tgt gca etc ate ctg gac age 2171 

ctg acc ggt gtg gac tct ggc cag tac atg tgc ttc gcg gee age gee 2219 

get ggc aac tgc agt acc ctg ggc aag ate ctg gtg caa gtc cca cca 2267 

egg ttc gtg aac aag gtc egg gee tea ccc ttt gtg gag gga gag gac 2315 

gee cag ttc acc tgc acc ate gaa ggc gee ccg tac ccg cag ate agg 2363 

tgg tac aag gac ggg gee ctg ctg acc act ggc aac aag ttc. cag aca 2411 

ctg agt gag cct cgc age ggc ctg eta gtg ctg gtg ate egg gcg gee 2459 

age aag gag gac ctg ggg etc tac gag tgt gag ctg gtg aac egg ctg 2507 

ggc tec gcg egg get agt gcg gag ctg cgc att cag age ccc atg ctg 2555 

cag gee cag gag cag tgt cac agg gag cag etc gtg get gca gtg gaa 2603 

gac acc acc ctg gag cga gcg gac cag gag gtc aca tct gtc ctg aag 2651 

aga ctg ctg ggc ccc aag gcg cca ggc ccc tec aca ggg gac etc act 2699 

ggc cct ggc ccc tgc ccc agg ggg gca ccc gca etc cag gaa acc ggc 2747 

tec cag ccc cca gtc acc gga act teg gag gca cct gee gtg ccc ccg 27 95 

agg gtg cca cag ccc etc etc cac gaa ggc cca gag cag gag ccg gag 2843 

gee att gee aga gee cag gaa tgg act gtg ccc att egg atg gag ggt 2891 

gca gee tgg ccc ggg gca ggc aca ggg gag ctg etc tgg gac gtc cac .2939 

age cac gtg gtc aga gag acc aca cag agg acc tac aca tac cag gee 2987 

ate gac acg cac acc gca egg ccc cca tec atg cag gta acc ate gag 3035 

gat gtg cag gca cag aca ggc gga acg gee caa ttc gag get ate att 3083 

gag ggc gac cca cag ccc teg gtg acc tgg tac aag gac age gtc cag 3131 

ctg gtg gac age acc egg ctt age cag cag caa gaa ggc acc aca tac 3179 

tec ctg gtg ctg agg cat gtg gee teg aag gat gee ggc gtt tac acc 3227 
tgc ctg gee caa aac act ggt ggc cag gtg etc tgc aag gca gag ctg • 3275 

ctg gtg ctt ggg ggg gac aat gag ccg gac tea gag aag caa age cac 3323 

egg agg aag ctg cac tec ttc tat gag gtc aag gag gag att gga agg 3371 

ggc gtg ttt ggc ttc gta aaa aga gtg cag cac aaa gga aac aag ate 3419 

ttg tgc get gee aag ttc ate ccc eta egg age aga act egg gee cag 3467 

gca tac agg gag cga gac ate ctg gee gcg ctg age cac ccg ctg gtc 3515 

acg ggg ctg ctg gac cag ttt gag acc cgc aag acc etc ate etc ate 3563 

ctg gag ctg tgc tea tec gag gag ctg ctg gac cgc ctg tac agg aag 3611 

ggc gtg gtg acg gag gee gag gtc aag gtc tac ate cag cag ctg gtg 3659 

gag ggg ctg cac tac ctg cac age cat ggc gtt etc cac ctg gac ata 3707 

aag ccc tct aac ate ctg atg gtg cat cct gee egg gaa gac att aaa 3755 

ate tgc gac ttt ggc ttt gee cag aac ate acc cca gca gag ctg cag 3803 

ttc age cag tac ggc tec cct gag ttc gtc tec ccc gag ate ate cag 3851 

cag aac cct gtg age gaa gee tec gac att tgg gee atg ggt gtc ate. 3899 

tec tac etc age ctg acc tgc tea tec cca ttt gee ggc gag agt gac 3947 

cgt gee acc etc ctg aac gtc ctg gag ggg cgc gtg tea tgg age age 3995 

ccc atg get gee cac etc age gaa gac gee aaa gac ttc ate aag get 4043 

acg ctg cag aga gee cct cag gee egg cct agt gcg gee cag tgc etc 4091 

tec cac ccc tgg ttc ctg aaa tec atg cct gcg gag gag gee cac ttc 4139 

ate aac. acc aag cag ere aag ttc etc ctg gee cga agt cgc tgg cag 4187 

cgt tec ctg atg age tac aag tec ate ctg gtg atg cgc tec ate cct 4235 

gag ctg ctg egg ggc cca ccc gac age ccc tec etc ggc gta gee egg 4283 

cac etc tgc agg gac act ggt ggc tec tec agt tec tec tec tec tct 4331 

gac aac gag etc gee cca ttt gee egg get aag tea ctg cca ccc tec 4379 

ccg gtg aca cac tea cca ctg ctg cac ccc egg ggc ttc ctg egg ccc 4427 

teg gee age ctg cct gag gaa gee. .gag gee agt gag cgc tec acc gag 4475 
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Thr Cys Thr 


He 


Glu 


Gly 


Ala 


Pro 


Tyr 


Pro 


Gin 


lie 


Arc 


755 






760 










765 








Trp Tyr Lys 


Asp Gly Ala 


Leu 


Leu 


Thr 


Thr 


Gly 


Asn 


Lys 


Phe 


Gin 


Thr 


770 




775 










780 










Leu Ser Glu 


Pro Arg Ser 


Gly 


Leu 


Leu 


Val 


Leu 


Val 


lie 


Arg Ala 


Ala 


785 


790 










795 










800 


Ser Lys Glu Asp Leu Gly 


Leu 


Tyr 


Glu 


Cys 


Glu 


Leu 


Val 


Asn Arg 


Leu 




805 








810 










815 




Gly Ser Ala Arg Ala Ser 


Ala 


Glu 


Leu 


Arg 


lie 


Gin 


Ser 


Pro 


Met 


Leu 




820 






825 










830 






Gin Ala Gin 


Glu Gin Cys 


His 


Arg 


Glu 


Gin 


Leu 


Val 


Ala 


Ala 


Val 


Glu 


835 






840 










845 








Asp Thr Thr 


Leu Glu Arg 


Ala 


Asp 


Gin 


Glu 


Val 


Thr 


Ser 


Val 


Leu 


Lys 


850 




855 










860 










Arg Leu Leu 


Gly Pro Lys 


Ala 


Pro 


Gly 


Pro 


Ser 


Thr 


Gly 


Asp 


Leu 


Thr 


865 


870 










875 










880 


Gly Pro Gly 


Pro Cys Pro 
885 


Arg 


Gly 


Ala 


Pro 
890 


Ala 


Leu 


Gin 


Glu 


Thr 
895 


Gly 


Ser Gin Pro 


Pro Val Thr 


Gly 


Thr 


Ser 


Glu 


Ala 


Pro 


Ala 


Val 


Pro 


Pro 




900 




905 










910 






Arg Val Pro 


Gin Pro Leu 


Leu 


His 


Glu 


Gly 


Pro 


Glu 


Gin 


Glu 


Pro 


Glu 


915 






920 










925 








Ala He Ala Arg Ala Gin 


Glu 


Trp 


Thr 


Val 


Pro 


He Arg 


Met 


Glu 


Gly 


930 




935 










940 










Ala Ala Trp Pro Gly Ala 


Gly 


Thr 


Gly 


Glu 


Leu 


Leu 


Trp 


Asp Val 


His 


945 


950 










955 










960 


Ser His Val 


Val Arg Glu 
965 


Thr 


Thr 


Gin 


Arg 
970 


Thr 


Tyr 


Thr 


Tyr 


Gin 
975 


Ala 


He Asp Thr 


His Thr Ala 


Arg 


Pro 


Pro 


Ser 


Met 


Gin 


Val 


Thr 


He 


Glu 


980 






985 










990 






Asp Val Gin 


Ala Gin Thr 


Gly 


Gly 


Thr 


Ala 


Gin 


Phe 


Glu 


Ala 


lie 


lie 


995 






1000 








1005 






Glu Gly Asp 


Pro Gin Pro 


Ser 


Val 


Thr 


Trp 


Tyr 


Lys 


Asp 


Ser 


Val 


Gin 


1010 




1015 








1020 








Leu Val Asp 


Ser Thr Arg 


Leu 


Ser 


Gin 


Gin 


Gin 


Glu 


Gly 


Thr 


Thr 


Tyr 


1025 


1030 








1035 








1040 


Ser Leu Val 


Leu Arg His 


Val 


Ala 


Ser 


Lys 


Asp Ala Gly Val 


Tyr 


Thr 




1045 








1050 








1055 


Cys Leu Ala 


Gin Asn Thr Gly Gly 


Gin 


Val 


Leu 


Cys 


Lys 


Ala 


Glu 


Leu 


1060 






1065 








1070 




Leu Val Leu 


Gly Gly Asp Asn 


Glu 


Pro Asp 


Ser 


Glu 


Lys 


Gin 


Ser 


His 


.1075 




1080 








1085 






Arg Arg Lys 


Leu His Ser 


Phe 


Tyr 


Glu 


Val 


Lys 


Glu 


Glu 


He 


Gly Arg 


1090 




1095 








1100 








Gly Val Phe 


Gly Phe Val 


Lys 


Arg Val 


Gin 


His 


Lys 


Gly 


Asn 


Lys 


lie 


.1105 


1110 








1115 








1120 


Leu Cys Ala 


Ala Lys Phe 


He 


Pro 


Leu 


Arg 


Ser 


Arg 


Thr 


Arg Ala 


Gin 


1125 








1130 








1135 


Ala Tyr Arg 


Glu Arg Asp 


He 


Leu 


Ala 


Ala 


Leu 


Ser 


His 


Pro 


Leu 


Val 


1140 






.1145 ' 








1150 
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Thr Gly Leu Leu Asp Gin Phe Glu Thr Arg Lys Thr Leu lie Leu lie 

1155 ■ 1160 1165 

Leu Glu Leu Cys Ser Ser Glu Glu Leu Leu Asp Arg Leu Tyr Arg Lys 

1170 1175 1180 

Gly Val Val Thr Glu Ala Glu Val Lys Val Tyr He Gin Gin Leu Val 
1185 1190 1195 1200 

Glu Gly Leu His Tyr Leu His Ser His Gly Val Leu His Leu Asp He 

1205 1210 1215 

Lys Pro Ser Asn He Leu Met Val His Pro Ala Arg Glu Asp He Lys 

1220 1225 1230 

He Cys Asp Phe Gly Phe Ala Gin Asn He Thr Pro Ala Glu Leu Gin 

1235 1240 1245 

Phe Ser Gin Tyr Gly Ser Pro Glu Phe Val Ser Pro Glu He He Gin 

1250 1255 1260 

Gin Asn Pro Val Ser Glu Ala Ser Asp He Trp Ala Met Gly Val He 
1265 1270 1275 1280 

Ser Tyr Leu Ser Leu Thr Cys Ser Ser Pro Phe Ala Gly Glu Ser Asp 

1285 1290 1295 

Arg Ala Thr Leu Leu Asn Val Leu Glu Gly Arg Val Ser Trp Ser Ser 

1300 1305 1310 

Pro Met Ala Ala His Leu Ser Glu Asp Ala Lys Asp Phe He Lys Ala 

1315 1320 1325 

Thr Leu Gin Arg Ala Pro Gin Ala Arg Pro Ser Ala Ala Gin Cys Leu 

1330 1335 1340 

Ser His Pro Trp Phe Leu Lys Ser Met Pro Ala Glu Glu Ala His Phe 
1345 1350 1355 1360 

He Asn Thr Lys Gin Leu Lys Phe Leu Leu Ala Arg Ser Arg Trp Gin 

1365 1370 1375 

Arg Ser Leu Met Ser Tyr Lys Ser He Leu Val Met Arg Ser lie Pro 

1380 1385 1390 

Glu Leu Leu Arg Gly Pro Pro Asp Ser Pro Ser Leu Gly Val Ala Arg 

1395 1400 1405 

His Leu Cys Arg Asp Thr Gly Gly Ser Ser Ser Ser Ser Ser Ser Ser 

1410 1415 1420 

Asp Asn Glu Leu Ala Pro Phe Ala Arg Ala Lys Ser Leu Pro Pro Ser 
1425 1430 1435 1440 

Pro Val Thr His Ser. Pro Leu Leu His Pro Arg Gly Phe Leu Arg Pro 

1445 145C 1455 

Ser Ala Ser Leu Pro Glu Glu Ala Glu Ala Ser Glu Arg Ser Thr Glu 

1460 1465 1470 

Ala Pro Ala Pro Pro Ala Ser Pro Glu Gly Ala Gly Pro Pro Ala Ala 

1475 1480 1485 

Gin Gly Cys Val Pro Arg His Ser Val lie Arg Ser Leu Phe Tyr His 

1490 1495 - 1500 

Gin Ala Gly Glu Ser Pro Glu His Gly Ala Leu Ala Pro Gly Ser Arg 
1505 ~ 1510 1515 1520 

Arg His Pro Ala Arg Arg Arg His Leu Leu Lys Gly Gly Tyr He Ala 

1525 1530 1535 

Gly Ala Leu Pro Gly Leu Arg Glu Pro Leu Met Glu His Arg Val Leu 

1540 " 1545 1550 

Glu Glu Glu Ala Ala Arg Glu Glu Gin Ala Thr Leu Leu Ala Lys Ala 

1555 1560 1565 

Pro Ser Phe Glu Thr Ala Leu Arg Leu Pro Ala Ser Gly Thr His Leu 

1570 1575 • 1580 

Ala Pro Gly His Ser His Ser Leu Glu His Asp Ser Pro Ser Thr Pro 
1585 1590 ■ 1595 1600 

Arc Pro Ser Ser Glu Ala Cys Gly.JGlu Ala Gin Arg Leu Pro Ser Ala 
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1605 1610 " 1615 

Pro Ser Gly Gly Ala Pro He Arg Asp Met Gly His Pro Gin Gly Ser 

1620 1625 1630 

Lys Gin Leu Pro Ser Thr Gly Gly His Pro Gly Thr Ala Gin Pro Glu 

1635 1640 1645 

Arg Pro Ser Pro Asp Ser Pro Trp Gly Gin Pro Ala Pro Phe Cys His 

1650 1655 1660 . 

Pro Lys Gin Gly Ser Ala Pro Gin Glu Gly Cys Ser Pro His Pro Ala 
1665 ' 1670 1675 1680 

Val Ala Pro Cys Pro Pro Gly Ser Phe Pro Pro Gly Ser Cys Lys Glu 

1685 1690 1695 

Ala Pro Leu Val Pro Ser Ser Pro Phe Leu Gly Gin Pro Gin Ala Pro 

1700 1705 . 1710 

Pro Ala Pro Ala Lys Ala Ser Pro Pro Leu Asp Ser Lys Met Gly Pro 

1715 1720 1725 

Gly Asp He Ser Leu Pro Gly Arg Pro Lys Pro Gly Pro Cys Ser Ser 

1730 1735 1740 

Pro Gly Ser Ala Ser Gin Ala Ser Ser Ser Gin Val Ser Ser Leu Arg 
1745 1750 1755 1760 

Val Gly Ser Ser Gin Val Gly Thr Glu Pro Gly Pro Ser Leu Asp Ala 

1765 1770 1775 

Glu Gly Trp Thr Gin Glu Ala Glu Asp Leu Ser Asp Ser Thr Pro Thr 

1780 1785 1790 

Leu Gin Arg Pro Gin Glu Gin Ala Thr Met Arg Lys Phe Ser Leu Gly 

1795 1800 1805 

Gly Arg Gly Gly Tyr Ala Gly Val Ala Gly Tyr Gly Thr Phe Ala Phe 

1810 " 1815 1820 

Gly Gly Asp Ala Gly Gly Met Leu Gly Gin Gly Pro Met Trp Ala Arg 
1825 L 1830 1835 1840 

He Ala Trp Ala Val Ser Gin Ser Glu Glu Glu Glu Gin Glu Glu Ala 

1845 1850 1855 

Arg Ala Glu Ser Gin Ser Glu Glu Gin Gin Glu Ala Arg Ala Glu Ser 

1860 1865 1870 

Pro Leu Pro Gin Val Ser Ala Arg Pro Val Pro Glu Val Gly Arg Ala 

1875 1880 1885 

Pro Thr Arg Ser Ser Pro Glu Pro Thr Pro Trp Glu Asp He Gly Gin 

1890 1895 1900 

Val Ser Leu Val Gin He Arg Asp Leu Ser Gly Asp Ala Glu Ala Ala 
1905 1910 1915 1920 

Asp Thr He Ser Leu Asp He Ser Glu Val Asp Pro Ala Tyr Leu Asn 

1925 1930 1935 

Leu Ser Asp Leu Tyr Asp He Lys Tyr Leu Pro Phe Glu Phe Met He 

1940 1945 1950 

Phe Arg Lys Val Pro Lys Ser Ala Gin Pro Glu Pro Pro Ser Pro Met 

1955 I960 1965 

Ala Glu Glu Glu Leu Ala Glu Phe Pro Glu Pro Thr Trp Pro Trp Pro 

1970 1975 1980 

Gly Glu Leu Gly Pro His Ala Gly Leu Glu He Thr Glu Glu Ser Glu 
1985 J 1990 1995 2000 

Asp Val Asp Ala Leu Leu Ala Glu Ala Ala Val Gly Arg Lys Arg Lys 

2005 2010 2015 

Trp Ser Ser Pro Ser Arg Ser Leu Phe His Phe Pro Gly Arg His Leu 

. 2020 2025 2030 

Pro Leu Asp Glu Pro Ala Glu Leu Gly Leu Arg Glu Arg Val Lys Ala 

2035 2040 2045 

Ser Val Glu His He Ser Arg He Leu Lys Gly Arg Pro Glu Gly Leu 
2050 2055 206.0 
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Glu Lys Glu Gly Pro Pro Arg Lys Lys Pro Gly Leu Ala Ser Phe Arg 
2065 2070 2075 2080 

Leu Ser Gly Leu Lys Ser Trp Asp Arg Ala Pro Thr Phe Leu Arg Glu 

2085 2090 2095 

Leu Ser Asp Glu Thr Val Val Leu Gly Gin Ser Val Thr Leu Ala Cys 

2100 2105 . 2110 

Gin Val Ser Ala Gin Pro Ala Ala Gin. Ala Thr Trp Ser Lys Asp Gly 

2115 2120 2125 

Ala Pro Leu Glu Ser Ser Ser Arg Val Leu lie Ser Ala Thr Leu Lys 

2130 2135 2140 

Asn Phe Gin Leu Leu Thr He Leu Val Val Val Ala Glu Asp Leu Gly 
2145 2150 2155 2160 

Val Tyr Thr Cys Ser Val Ser Asn Ala Leu Gly Thr Val Thr Thr Thr 

2165 2170 2175 

Gly Val Leu Arg Lys Ala Glu Arg Pro Ser Ser Ser Pro Cys Pro Asp 

2180 2185 2190 

He Gly Glu Val Tyr Ala Asp Gly Val Leu Leu Val Trp Lys Pro Val 

2195 2200 2205 

Glu Ser Tyr Gly Pro Val Thr Tyr He Val Gin Cys Ser Leu Glu Gly 

2210 2215 2220 

Gly Ser Trp Thr Thr Leu Ala Ser Asp He Phe Asp Cys Cys Tyr Leu 
2225 ~ 2230 2235 2240 

Thr Ser Lys Leu Ser Arg Gly Gly Thr Tyr Thr Phe Arg Thr Ala Cys 

2245 2250 2255 

Val Ser Lys Ala Gly Met Gly Pro Tyr Ser Ser Pro Ser Glu Gin Val 

2260 2265 2270 

Leu Leu Gly Ala Pro Ser His Leu Ala Ser Glu Glu Glu Ser Gin Gly 

2275 2280 2285 

Arg Ser Ala Gin Pro Leu Pro Ser Thr Lys Thr Phe Ala Phe Gin Thr 

2290 2295 2300 

Gin He Gin Arg Gly Arg Phe Ser Val Val Arg Gin Cys Trp Glu Lys 
2305 2310 2315 2320 

Ala Ser Gly Arg Ala Leu Ala Ala Lys He He Pro Tyr His Pro Lys 

2325 2330 2335 

Asp Lys Thr Ala Val Leu Arg Glu Tyr Glu Ala Leu Lys Gly Leu Arg 

2340 2345 2350 

His Pro His Leu Ala Gin Leu His Ala Ala Tyr Leu Ser Pro Arg His 

2355 2360 2365 

Leu Val Leu He Leu Glu Leu Cys Ser Gly Pro Glu Leu Leu Pro Cys 

2370 2375 2380 

Leu Ala Glu Arg Ala Ser Tyr Ser Glu Ser Glu Val Lys Asp Tyr Leu 
2385 ^ 2390 2395 2400 

Trp Gin Met Leu Ser Ala Thr Gin Tyr Leu His Asn Gin His He Leu 

2405 2410 2415 

His Leu Asp Leu Arg Ser Glu Asn Met He He Thr Glu Tyr Asn Leu 

2420 2425 2430 

Leu Lys Val Val Asp Leu Gly Asn Ala Gin Ser Leu Ser Gin Glu Lys 

2435 ~ 2440 2445 

Val Leu Pro Ser Asp Lys Phe Lys Asp Tyr Leu Glu Thr Met Ala Pro 

2450 2455 2460 

Glu Leu Leu Glu Gly Gin Gly Ala Val Pro Gin Thr Asp He Trp Ala 
2465 2470 2475 2480 

lie Gly Val Thr Ala Phe He Met Leu Ser Ala Glu Tyr Pro Val Ser 

2485 2490 2495 

Ser Glu Gly Ala Arg Asp Leu Gin Arg Gly Leu Arg Lys Gly Leu Val 

2500 2505 2510 

Arg Leu Ser Arg Cys Tyr Ala Gly.JLeu Ser Gly Gly Ala Val Ala Phe 
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2515 2520 2525 

Leu Arg Ser Thr Leu Cys Ala Gin Pro Trp Gly Arg Pro Cys Ala Ser 

2530 2535 2540 

Ser Cys Leu Gin Cys Pro Trp Leu Thr Glu Glu Gly Pro Ala Cys Ser 
2545 2550 2555 2560 

Arg Pro Ala Pro Val Thr Phe Pro Thr Ala Arg Leu Arg Val Phe Val 

2565 2570 2575 

Arg Asn Arg Glu Lys Arg Arg Ala Leu Leu Tyr Lys Arg His Asn Leu 

2580 2585 2590 

Ala Gin Val Arg 
2595 



<210> 7 
<211> 871 
<212> PRT 

<213> Homo sapiens 



<400> 7 



Met 


Gly 


Pro 


Gly 


Asp 


He 


Ser 


Leu 


Pro 


Gly 


Arg 


Pro 


Lys 


Pro 


Gly 


Pro 


1 




5 










10 










15 




Cys 


Ser 


Ser 


Pro 


Gly 


Ser 


Ala 


Ser 


Gin 


Ala 


Ser 


Ser 


Ser 


Gin 


Val 


Ser 






20 








25 










30 






Ser 


Leu 


Arg 
35 


Val 


Gly 


Ser 


Ser 


Gin 
40 


Val 


Gly 


Thr 


Glu 


Pro 
45 


Gly 


Pro 


Ser 


Leu 


Asp 
50 


Ala 


Glu 


Gly 


Trp 


Thr 
55 


Gin 


Glu 


Ala 


Glu 


Asp 
60 


Leu 


Ser 


Asp 


Ser 


Thr 


Pro 


Thr 


Leu 


Gin 


Arg 


Pro 


Gin 


Glu 


Gin 


Ala 


Thr 


Met 


Arg 


Lys 


Phe 


65 










70 










75 










80 


Ser 


Leu 


Gly 


Gly 


Arg 


Gly 


Gly Tyr Ala 


Gly 


Val 


Ala 


Gly 


Tyr 


Gly 


Thr 










85 










90 










95 




Phe 


Ala 


Phe 


Gly 


Gly 


Asp 


Ala 


Gly 


Gly 


Met 


Leu Gly Gin 


Gly 


Pro 


Met 








100 










105 










110 






Trp 


Ala 


Arg 


He 


Ala 


Trp Ala 


Val 


Ser 


Gin 


Ser 


Glu 


Glu 


Glu 


Glu 


Gin 




115 










120 










125 








Glu 


Glu 


Ala 


Arg 


Ala 


Glu 


Ser 


Gin 


Ser 


Glu 


Glu 


Gin 


Gin 


Glu 


Ala 


Arg 




130 








135 










140 










Ala 


Glu 


Ser 


Pro 


Leu 


Pro 


Gin 


Val 


Ser 


Ala 


Arg 


Pro 


Val 


Pro 


Glu 


Val 


145 










150 










155 










160 


Gly 


Arg 


Ala 


Pro 


Thr 


Arg 


Ser 


Ser 


Pro 


Glu 


Pro 


Thr 


Pro 


Trp 


Glu 


Asp 






165 










170 










175 




lie 


Gly 


Gin 


Val 


Ser 


Leu 


Val 


Gin 


He 


Arg 


Asp 


Leu 


Ser 


Gly 


Asp 


Ala 






180 










185 










190 






Glu 


Ala 


Ala 


Asp 


Thr 


He 


Ser 


Leu Asp 


He 


Ser 


Glu 


Val 


Asp 


Pro 


Ala 






195 








200 










205 








Tyr 


Leu 


Asn 


Leu 


Ser 


Asp 


Leu 


Tyr Asp 


He 


Lys 


Tyr 


Leu 


Pro 


Phe 


Glu 


210 










215 










220 










Phe 


Met 


lie 


Phe 


Arg 


Lys 


Val 


Pro 


Lys 


Ser 


Ala 


Gin 


Pro 


Glu 


Pro 


Pro 


225 








230 










235 










240 


Ser 


Pro 


Met 


Ala 


Glu 


Glu 


Glu 


Leu 


.Ala 


Glu 


Phe 


Pro 


Glu 


Pro 


Thr 


Trp 
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Pro 


Trp 


Pro 


Gly 








260 


Glu 


Ser 


Glu 


Asp 






275 




Lys 


Arg ■ 


Lys 


Trp 




290 






Arg 


His 


Leu 


Pro 


305 








Val 


Lys 


Ala 


Ser 


Glu Gly 


Leu 


Glu 








340 


Ser 


Phe 


Arg 


Leu 






355 




Leu Arg 


Glu 


Leu 




370 






Leu 


Ala 


Cys 


Gin 


385 








Lys 


Asp 


Gly Ala 


Thr 


Leu 


Lys 


Asn 








420 


Asp 


Leu 


Gly 


Val 






435 




Thr 


Thr 


Thr 


Gly 




450 






Cys 


Pro 


Asp 


He 


465 








Lys 


Pro 


Val 


Glu 


Leu 


Glu 


Gly 


Gly 








500 


Cys 


Tyr 


Leu 


Thr 






515 




Thr 


Ala 


Cys 


Val 




530 






Glu 


Gin 


Val 


Leu 


545 








Ser 


Gin 


Gly Arg 


Phe 


Gin 


Thr 


Gin 








580 


Trp 


Glu 


Lys 


Ala 






595 




His 


Pro 


Lys 


Asp 




610 






Gly Leu 


Arg 


His 


625 








Pro Arg 


His 


Leu 


Leu 


Pro 


Cys 


Leu 








660 


Asp 


Tyr 


Leu 


Trp 






67 5 




His 


lie 


Leu 


His 




690 







Glu 


Leu 


Gly 


Pro 


Val 


Asp 


Ala 


Leu 








280 


Ser 


Ser 


Pro 


Ser 






295 




Leu 


Asp 


Glu 


Pro 




310 






Val 


Glu 


His 


He 


325 








Lys 


Glu 


Glv 


Pro 


Ser 


Gly 


Leu 


Lys 








360 


Ser 


Asp 


Glu 


Thr 






375 




Val 


Ser 


Ala 


Gin 




390 






Pro 


Leu 


Glu 


Ser 


405 








Phe 


Gin 


Leu 


Leu 


Tyr 


Thr 


Cys 


Ser 








440 


Val 


Leu 


Arg 


Lys 






455 




Gly 


Glu 


Val 


Tyr 




470 






Ser 


Tyr 


Gly 


Pro 


485 












Thr 


Thr 


Ser 


Lys 


Leu 


Ser 








520 


Ser 


Lys 


Ala 


Gly 






535 




Leu 


Gly 


Gly 


Pro 




550 






Ser 


Ala 


Gin 


Pro 


565 








He 


Gin 


Arg 


Glv 


Ser 


Gly 


Arg 


Ala 








600 


Lys 


Thr 


Ala 


Val 






615 




Pro 


His 


Leu 


Ala 




630 






Val 


Leu 


He 


Leu 


645 








Ala 


Glu 


Arg 


Ala 


Gin 


Met 


Leu 


Ser 








680 


Leu 


Asp 


Leu 


Arg 






695 





250 

His Ala Gly Leu 
265 

Leu Ala Glu Ala 

Arg Ser Leu Phe 
300 

Ala Glu Leu Gly 
315 

Ser Arg He Leu 
330 

Pro Arg Lys Lys 
345 

Ser Trp Asp Arg 

Val Val Leu Gly 
380 

Pro Ala Ala Gin 
395 

Ser Ser Arg Val 
410 

Thr He Leu Val 
425 

Val Ser Asn Ala 

Ala Glu Arg Pro 
460 

Ala Asp Gly Val 
475 

Val Thr Tyr He 
490 

Leu Ala Ser Asp 
505 

Arg Gly Gly Thr 

Met Gly Pro Tyr 
540 

Ser His Leu Ala 
555 

Leu Pro Ser Thr 
570 

Arg Phe Ser Val 
585 

Leu Ala Ala Lys 

Leu Arg Glu Tyr 
620 

Gin Leu His Ala 
635 

Glu Leu Cys Ser 
650 

Ser Tyr Ser Glu 
665 

Ala Thr Gin Tyr 

Ser Glu Asn Met 
700 



255 

Glu He Thr Glu 
270 



Ala Val Gly Arg 
285 



His 


Phe 


Pro 


Gly 


Leu 


Arg 


Glu 


Arg 








320 


Lys 


Gly 


Arg 


Pro 






335 




Pro 


Gly 


Leu 


Ala 




350 






Ala 


Pro 


Thr 


Phe 


365 








Gin 


Ser 


Val 


Thr 


Ala 


Thr 


Trp 


Ser 








400 


Leu 


He 


Ser 


Ala 






415 




Val 


Val 


Ala 


Glu 




430 






Leu 


Gly 


Thr 


Val 


445 








Ser 


Ser 


Ser 


Pro 


Leu 


Leu 


Val 


Trp 








480 


Val 


Gin 


Cys 


Ser 






4 95 




He 


Phe 


Asp 


Cvs 




510 






Tyr 


Thr 


Phe 


Arg 


525 








Ser 


Ser 


Pro 


Ser 


Ser 


Glu 


Glu 


Glu 








560 


Lys 


Thr 


Phe 


Ala 






575 




Val 


Arg 


Gin 


Cys 




590 






He 


He 


Pro 


Tyr 


605 








Glu 


Ala 


Leu 


Lys 


Ala 


Tyr 


Leu 


Ser 








640 


Gly 


Pro 


Glu 


Leu 






655 




Ser 


Glu 


Val 


Lys 




670 






Leu 


His 


Asn 


Gin 


685 








He 


He 


Thr 


Glu 
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Tyr 


Asn 


Leu 


Leu 


Lys Val Val Asp Leu Gly 


Asn 


Ala 


Gin 


Ser Leu 


Ser 


705 








710 


715 








720 


Gin 


Glu 


Lys 


Val 


Leu Pro Ser Asp Lys Phe 


Lys 


Asp 


Tyr 


Leu Glu 


Thr 








725 730 








735 




Met 


Ala 


Pro 


Glu Leu Leu Glu Gly Gin Gly Ala 


Val 


Pro 


Gin Thr 


Asp 








740 


745 








750 




lie 


Trp 


Ala 


He 


Gly Val thr Ala Phe He 


Met 


Leu 


Ser 


Ala Glu 


Tyr 




755 




760 






7 65 






Pro 


Val 


Ser 


Ser 


Glu Gly Ala Arg Asp Leu 


Gin 


Arg 


Gly 


Leu Arg 


Lys 




770 






775 




780 








Gly 


Leu 


Val 


Arg 


Leu Ser Arg Cys Tyr Ala 


Gly 


Leu 


Ser 


Gly Gly Ala 


785 








790 


795 








800 


Val 


Ala 


Phe 


Leu 


Arg Ser Thr Leu Cys Ala 


Gin 


Pro 


Trp 


Gly Arg 


Pro 










805 810 








815 




Cys 


Ala 


Ser 


Ser 


Cys Leu Gin Cys Pro Trp 


Leu 


Thr 


Glu 


Glu Gly 


Pro 






820 


825 








830 




Ala 


Cys 


Ser 


Arg 


Pro Ala Pro Val Thr Phe 


Pro 


Thr 


Ala 


Arg Leu 


Arg 




835 


840 






845 






Val 


Phe 


Val 


Arg Asn Arg Glu Lys Arg Arg 


Ala 


Leu 


Leu 


Tyr Lys Arg 




850 






855 




860 








His 


Asn 


Leu 


Ala 


Gin Val Arg 












865 








870 













<210> 8 

<211> 548 

<212> PRT 

~<213> Rattus norvegicus 



<400> 8 



Met 


Ala 


His 


He 


Ser 


Arg He 


Leu 


Lys 


Gly Lys 


Pro 


Glu 


Gly 


Pro 


Glu 


1 








5 








10 










15 




Lys 


Glu 


Gly 


Pro 


Pro 


Arg Lys 


Lys 


Ala 


Gly 


Leu 


Ala 


Ser 


Phe Arg 


Leu 






20 








25 










30 






Ser 


Gly 


Leu 
35 


Lys 


Gly 


Arg Asp 


Gin 
40 


Ala 


Pro 


Ser 


Phe 


Leu 
45 


Arg 


Glu 


Leu 


Ser 


Asp 


Glu 


Ala 


Val 


Val Leu Gly Gin 


Ser 


Val 


Thr 


Leu 


Ala 


Cys 


Gin 




50 








55 










60 










Val 


Leu 


Ala 


Gin 


Pro 


Thr Ala 


Gin 


Ala 


Thr 


Trp 


Ser 


Lys 


Asp Gly Ala 


65 










70 








75 










80 


Leu 


Leu 


Glu 


Ser 


Ser 


Gly His 


Leu 


Leu 


He 


Ser 


Ser 


Thr 


Leu 


Lys 


Asn 










85 






90 










95 




Phe 


Gin 


Leu 


Leu 
100 


Thr 


He Leu 


Val 


Val 
105 


Thr 


Glu 


Glu 


Asp 


Leu 
110 


Gly 


Thr 


Tyr 


Thr 


Cys 


Cys 


Val 


Ser Asn 


Pro 


Leu Gly Thr Ala 


Val 


Thr 


Thr 


Gly 




115 






120 










125 








Val 


Leu 


Arg 


Lys 


Ala 


Glu Arg 


Pro 


Ser 


Ser 


Ser 


Pro 


Arg 


Pro 


Glu 


Val 




130 






135 










140 










Gly 


Glu 


Leu 


Tyr 


Thr 


Asp Ala 


Val 


Leu 


Leu 


Val 


Trp 


Lys 


Pro 


Val 


Glu 


145 








150 








155 










160 


Ser 


Tyr 


Gly 


Pro 


Val 


Thr Tyr 


He 


Val 


Gin 


Cys 


Cys 


He 


Glu 


Gly 


Gly 






165 








170 










175 




Ser 


Trp 


Thr 


Thr 


Leu 


Ala Ser 


Asp 


He 


Ser 


Asp Cys 


Cys 


Tyr 


Leu 


Thr 






180 








185 










190 






Gly 


Lys 


Leu 


Pro 


Arg 


Gly Gly 


Met 


Tyr 


Thr 


Phe 


Arg 


Thr 


Ala 


Cys 


Val 


195 








200 










205 








Ser 


Lys 
210 


Ala 


Gly 


Met 


Gly Pro 
215 


Tyr 


Ser 


Ser 


Pro 


Ser 
220 


Glu 


Gin 


Val 


Leu 
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Leu Gly Gly 

225. 

Arg Pro Ala 

Gin lie Arg 

Ala Ser Gly 
275 

Asp Lys Thr 

290 
His Pro His 
305 

Leu Val Leu 

Leu Ala Glu 

Trp Gin Met 
355 

His Leu Asp 

370 
Leu Lys Val 
385 

Val Pro Pro 

Glu Leu Leu 

lie Gly Val 
435 

Ser Glu Gly 

4 50 
Gin Leu Ser 
465 

Leu Gin Ser 

Thr Cys Leu 

Arg Pro Thr 
515 

Arg Glu Arg 

530 
Ala Gin Val 
545 



Pro Asn 

Gin Leu 
245 
Arg Gly 
260 

Arg Ala 

Thr Val 

Leu Ala 

lie Leu 
325 
Arg Asp 
340 

Leu Ser 

Leu Arg 

lie Asp 

Pro Glu 
405 
Glu Gly 
420 

Thr Ala 

Thr Arg 

Arg Cys 

Ser Leu 
485 
Gin Cys 
500 

Pro Val 
Glu Lys 
Arg 



His Leu Ala 
230 

Leu Pro Ser 

Arg Phe Ser 

Leu Ala Ala 
280 

Leu Arg Glu 

295 
Gin Leu His 
310 

Glu Leu Cys 

Ser Tyr Ser 

Ala Thr Gin 
360 

Ser Glu Asn 

375 
Leu Gly Asn 
390 

Asn Phe Lys 

Gin Gly Ala 

Phe He Met 
440 

Asp Leu Gin 

455 
Tyr Ala Gly 
470 

Cys Ala Arg 

Gly Trp Leu 

Thr Phe Pro 
520 

Arg Arg Ala 
535 



Ser Glu 

Thr Lys 
250 
Val Val 
265 

Lys He 

Tyr Glu 

Ala Ala 

Ser Gly 
330 
Glu Ser 
345 

Tyr Leu 

Met Met 

Ala Gin 

Asp Tyr 
410 
Val Pro 
425 

Leu Ser 

Lys Gly 

Leu Ser 

Pro Trp 
490 
Thr Glu 
505 

Thr Ala 
Leu Leu 



Glu Glu 
235 

Thr Phe 

Arg Gin 

Val Pro 

Ala Leu 
300 
Tyr Leu 
315 

Pro Glu 

Asp Val 

His Ala 

Val Thr 
380 
Ser Leu 
395 

Leu Glu 

Gin Thr 

Gly Glu 

Leu Arg 
460 
Gly Gly 
475 

Gly Arg 

Glu Gly 

Arg Leu 

Tyr Lys 
540 



Ser Ser 

Ala Phe 

Cys Arg 
270 
Tyr Gin 
285 

Lys Arg 

Ser Pro 

Leu Leu 

Lys Asp 
350 
Gin His 
365 

Glu Tyr 

Ser Gin 

Thr Met 

Asp He 
430 
Tyr Pro 
445 

Lys Gly 

Ala Val 

Pro Cys 

Pro Thr 
510 
Arg Ala 
525 

Lys His 



Arg Gly 
240 
Gin Thr 
255 

Glu Lys 

Pro Glu 

Leu His 

Arg His 
320 
Pro Ser 
335 

Tyr Leu 

He Leu 

Asn Leu 

Glu Lys 
400 
Ala Pro 
415 

Trp Ala 

Val Ser 

Leu He 

Ala Phe 
480 
Ala Ser 
495 

Gly Ser 
Phe Val 
Asn Leu 



<210> 9 
<211> 548 
<212> PRT 

<213> Mus musculus 
<400> 9 

Met Ala His He Ser Arg lie Leu Lys Gly Arg Pro Glu Gly Pro Glu 

15 10 15 

Arg Glu Gly Pro Pro Arg Lys Lys. Ala Gly Leu Ala Ser Phe Arg Leu 
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20 25 30 

Ser Gly Leu Lys Gly Arg Asp Gin Ala Pro Ser Phe Leu Arg Glu Leu 

35 40 45 

Ser Asp Glu Ala Val Val Leu Gly Gin Ser Val Thr Leu Ala Cys Gin 

50 55 60 

Val Leu Ala Gin Pro Thr Ala Gin Ala Thr Trp Ser Lys Asp Gly Val 
65 70 75 80 

Leu Leu Glu Ser Ser Gly His Leu Leu He Ser Ser Thr Leu Lys Asn 

85 90 95 

Phe Gin Leu Leu Thr He Leu Val Val Lys Glu Glu Asp Leu Gly Thr 

100 105 HO 

Tyr Thr Cys Cys Val Ser Asn Pro Leu Gly Thr Ala Val Thr Thr Gly 

115 120 125 

Val Leu Arg Lys Ala Glu Arg Pro Ser Ser Ser Pro Arg Pro Glu Val 

130 ' ' 135 140 

Gly Glu Leu Tyr Lys Asp Ala Val Leu Leu Val Trp Lys Pro Val Glu 
145 " 150 155 160 

Ser Cys Gly Pro Val Thr Tyr He Val Gin Cys Cys He Glu Gly Gly 

165 170 _ 175 

Ser Trp Thr Thr Leu Ala Ser Asp lie Ser Asp Cys Cys Tyr Leu Thr 

180 185 190 

Gly Lys Leu Ser Arg Gly Gly Met Tyr He Phe Arg Thr Ala Cys Val 

195 200 205 

Ser Lys Ala Gly Met Gly Pro Tyr Ser Ser Pro Ser Glu Gin Val Leu 

210 215 220 

Leu Gly Gly Pro Asn His Leu Ala Ser Glu Glu Glu Ser Ser Arg Gly 
225 230 235 240 

Arg Pro Ala Gin Leu Leu Pro Ser Thr Lys Thr Phe Ala Phe Gin Met 

245 250 255 

Gin lie Arg Arg Gly Arg Phe Ser. Val Val Arg Gin Cys Arg Glu Lys 

260 265 270 

Ala Ser Gly Arg Ala Leu Ala Ala Lys He Val Pro Tyr Gin Pro Glu 

275 280 285 

Asp Lys Thr Ala Val Leu Arg Glu Tyr Glu Ala Leu Lys Arg Leu His 

290 295 300 

His Pro His Leu Ala Gin Leu His Ala Ala Tyr Leu Ser Pro Arg His 
305 310 315- 320 

Leu Val Leu He Leu Glu Leu Cys Ser Gly Pro Glu Leu Leu Pro Ser 

325 330 335 

Leu Ala Glu Arg Glu Ser Tyr Ser Glu Ser Asp Val Lys Asp Tyr Leu 

340 345 . 350 

Trp Gin Met Leu Ser Ala Thr Gin Tyr Leu His Ala Gin His lie Leu 

355 360 365 

His Leu Asp Leu Arg Ser Glu Asn Met Met Val Thr Glu Tyr Asn Leu 

370 375 380 

Leu Lys Val He Asp Leu Gly Asn Ala Gin Ser Leu Asp Gin Glu Lys 
385 390 395 400 

Val Pro Ala Pro Glu Asn Phe Lys Asp Tyr Leu Glu Thr Met Ala Pro 

405 410 415 

Glu Leu Leu Glu Gly Gin Gly Ala Val Pro Gin Thr Asp He Trp Ala 

420 425 430 

He Gly Val Thr Ala Phe lie Met Leu Ser Gly Glu Tyr Pro Glu Ser 

435 440 445 

Ser Glu Gly Thr Arg Asp Leu Gin Lys Gly Leu Arg Lys Gly Leu lie 

450 455 460 

Arg Leu Ser Arg Cys Tyr Ala Gly Leu Ser Gly Gly Ala Val Ala Phe 
465 470 475 480 
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Leu Gin Ser Ser Leu Cys Ala Gin 
485 

Thr Cys Leu Gin Cys Gly Trp Leu 
500 

Arg Pro Thr Pro Val Thr Phe Pro 
515 520 
Arg Glu Arg Glu Lys Arg Arg Ala 

530 " 535 
Ala Gin Val Arg 
545 



Pro Trp Gly Arg Pro Cys Ala Ser 

490 495 
Thr Glu Glu Gly Pro Thr Gly Ser 
505 510 
Thr Val Arg Leu Arg Ala Phe Val 
525 

Leu Leu Tyr Lys Lys His Asn Leu 
54 0 
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MA Majority 

ID 20 30 40 
-I 1 , , I 



M|G PGDISLPGRPKPGPCSSPGSASQASSSQVSSLRVGSSQ H19G5-Fl.pep 

Ma| R19G5-S.pep 

M Al ML9G5-S.pep 

Majority 

50 60 70 80 
i — 1 1 u 

41 VGTEPGPSLDAEGWTQEAEDLSDSTPTLQRPQEQATMRKF H19G5-Fl.pep 

3 R19G5-S.pep 

3 M19G5-S.pep 

Majority 

i i i i 

90 100 110 120 

I I 1 L. 

8L SLGGRGGYAGVAGYGTFAFGGDAGGMLGQGPMWARIAWAV H19G5-Fl.pep 

3 KL9G5-S.pep 

3 ML9G5-S.pep 



- - - — - - - ~ Majority 

130 140 150 160 
I 1 1 u 

121 SQSEEEEQEEARAESQSEEQQEARAESPLPQVSARPVPEV H19G5-Fl.pep 

3 R19G5-S.pep 

3 . M19G5-S.pep 



Majority 

1 1 1 r 

170 180 ISO 200 
i 1 1 1- 

161 GRAPTRSSPEPTPWEDIGOVSLVQIRDLSGDAEAADTISL H19G5-Fl.pep 

3 R19G5-S.pep 

3 ML9G5-S.pep 



Majority 



210 220 230 240 
1 1 1 l. 

201 DISEVDPAYLNLSDLYDIKYLPFEFMIFRKVPKSAQPEPP H19G5-Fl.pep 

3 R19G5-S.pep 

3 M19G5-S.pep 



Majority 



r — — — 1 1 

250 260 270 280 
I 1 I L- 

241 SPMAEEELAEFPEPTWPWPGELGPHAGLEITEESEDVDAL HL9G5-Fl.p^ 

3 R19G5-S.pep 

3 M19G5-S.pep 



- — — Majority 



290 300 310 320 
I I I L- 

281 LAEAAVGRKRKWSSPSRSLFHFPGRHLPLDEPAELGLRER HL9G5-Fl.p^> 

3 R19G5-S.pep 

3 M19G5-S.pg? 
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Monday, April 3, 2000 5:1 1 PM 



H 



ISRILKGRPEGPEKEGPPRKKAGLASFRLSGLK Majority 



330 



321 

3 

3 



V K A S V E 



340 



T 

350 
_1_ 



360 



HISRILK GR P E G[ljE KEGPPRK K^jG LASFRLSGL K 
HISRILK gH P E G P EKE GPPRKKAGLASFRLSGLK 
HISRUKGRPEGP eI r Ie GPPRKKACI.A SFRLSGLK 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



GRDQAPSFLRELSDEAVVLGQSVTLACOVLAQPTAQATWS Majority 

370 380 390 400 

361 SWJeLr)A P[T)F LRELSD e[tJV VLGQSVTLACQ Vj_SjA Q Pj_AjA QATWS 

37 |g r d oapsflrelsdeavvlgosvtlacqvlaqptaqatws 
37 Igrpoapsflrelsdeavvlgosvt lacovlaoptaoatws 



401 

77 

77 



441 
117 
117 



H19G5-Fl.pep 
R19G5-S.pep 
M19G5-S.pep 



KDGALLESSGHLLISSTLKNFQLLTILVVXEEDLGTYTCC Majority 

410 420 430 

— L 



440 



K D G Al PiL E S S i S R VI L I SUIT LKNFOLLTI LVVjV AjE n T. r,[v|V T KL9G5-Fl.pep 



LLESSGHLLISSTLKNFQLLTILV V]t|E EDLGTYTCC 

llessghlli'sstlk nfolltilvvi k Ieedlgtytcc 



R19G5-S.pep 
M19G5-S.pep 



VSNPLGTAVTTGVLRKAERPSSSPRPEVGELYXDAVLLVW Majority 



470 



450 460 

V S ^ A|L G T jV T| T TGVLRKAERPSSS P[C>P|D _I)G E[v]Y 
VSNPLGTAVTTGVLRKAERPSSSPRPEVGELY 
VSNPLGTAVTTGVLRKAERPSSSPRP EVGELY 



480 

L 



d |gJ v llvw 

DAVLLVW 



D A V L L VWI M19G5-S.pep 



HL9G5-Fl.pep 
R19G5-S.pep 
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KPVESYGPVTYIVQCCIEGGSWTTLASDI 
i .i 

500 510 



481 
157 
157 



490 



SDCCYLTGXLS 
t r 

520 



Majority 



1 » i ■ 

KPVESYGPVTYIVQ C | S L| E GGSWTTLASD l[FjD C C Y L t[sJk L sj H19G5-F1. 



K P V E SYG PVTYIVQCCIEGGSWTTLASD 

del 



K P V E 



1G PVTYIVQCCIEGGSWTTLASD 



I SDCCYLTGK L|_P 
I SDCCYLTGK LSI 



pep 
R19G5-S.pep 
M19G5-S.pep 



521 
197 
197 



561 
237 
237 



RGGMYTFRTACVSKAG'MGPYSSPSEQVLLGGPNHLASEEE Majority 

540 



530 



550 



560 



R G Gj__Tj Y TFRTACVSKAGMGPYSSPSEQVLLGG Pj_S|H LASEEE 
R G G M YTF RTACVSKAGMGPYSSPSEQVLLGGPNHLASEEE 

mdik 



RG G 



R TACVSKAGMGPYSSPSEOVLLGGPNHLASEEE 



H19G5-Fl.pep 

R19G5-S.pep 

ML9G5-S.pep 



SSRGRPAQLLPSTKTFAFQTQIRRGRFSVVRQCREKASGR Majority 



570 



580 



T 

590 

4- 



600 
i 



sL_Q)g R[_s|a q[pJl pstktfafqtq iLqJr GRFSVVRQ c[wje kasgr 
ssrgrpaqllpstktfaf qtq irrgrf svvrqcrekasg r 
ssrgrpaqllpstktfaf oI m Io irrgrfsvvroc re kasgr 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



600 
277 
277 



ALAAKIVPYQPEDKTAVL 
610 



REYEALKRLHHPHLAQLHAAYL Majority 
620 630 640 



i t > i 

A l a A k i[ijp y|_hJpLkjd ktavlreyeal k[g|l[rJh PHLAQLHAAYL 

ALAAKIVPYQ PEDK T@V LREYEALKRLHHPHLAQLHAAYL 
AXAAKIVPYOPEDKTAVLREYEAL KRLHHPHLAOLHAAVT, 



H19G5-Fl.pep 

R19G5-S.pep 

ML9G5-S.pqp 
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SPRHLVLILEL'CSGPELLPSLAERXSYSESDVKDYLWQML Majority 



640 
317 
317 



680 
357 
357 



650 



660 



670 



SPRHLVLILELCSGPELL P[£J L A E R A 
SPRHLVLILELCSGPELLPSLAERD 

sprhlvm lelcsgpellpslaerIe 



L_ U 

S Y S E s[eJv KDYLWQML 
SYSESDVKDYLWQML 

gysESD.V KPY^wpMt* 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



SATOYL'HAQHILHLDLRSENMMVTEYNLLKVIDLGNAQSL Majority 
690 



700 

_L_ 



-r 

710 



720 
1 



satqyl h[n)q hilhldlrsen mLi_je)t eynllk v[vJd lgnaqsl 
satqylhaqhilhldlrsenmmvteynllkvidlgnaqsl 

SATQYLHAQHILHLDLRSENMMVT EYNLLKVIDLGNAOS 



KL9G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 
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SQEKVPPPENFKDYLETMAPELLEGQGAVPQTDIWAIGVT Majority 

750 



720. 
397 
397 



760 
437 
437 



800 
477 
477 



840 

517 
517 



730 



740 



760 
1 



o e k vIlIp Is DKl FKDYLETMAPELLEGQGAVPQTDIMAIGVT 
QEKVPPPENFKDYLETMAPELLEGQGAVPQTDIWAIGVT 
oekv pT a Ip ENFKDYLETMAPELLEGOGAVPOTDIWAIGVT 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



A F 



IMLSGEYP VSSEGTRDLQKGLRKGLIRLSRCYAGLSGG Majority 
, 1 — 1 ' T" 

770 780 790 . 800 

X. 



770 780 790 . 80( 

a f i m l s[a)e ypvsse g[aJr d l q[r)g l r k g l[v|r L SRCYAGLSGG 



A F I ML S G EYPVS SEGTRDLQKGLRKG L^LfijL SRCYAGLSGG 
AFIMLSGEY Pf l ls SEGTRPLQKGUKGURL$yCY AgL SGQ 



AVAFLQSSL CAQPWGRPCASTCLQCGWLTEEGPTGS 
* , * — | i 



810 

A V A F L(_rJs|_T)L C A 
AVAFLQS S LCA 



820 



R P T P 
r 

830 840 

P W G R P C A S|S1C L 0 ClPlw L T E E G P | A Cl S R PlA|P 
PWGRPCASTCLQCGWLTEEGPTGSRPTP 
AVAFLOSSLCAOPWGRPCASTCLOCGWLTEEGPTGSRPTPI 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 

Majority 



KL9G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



VTFPTARLRAFVREREKRRALLYKKHNLA 

_. _ 1 



T 

850 



860 



870 



VTFPTARL RlVjF V R|_NjR EKRRALLY K|_R]H N L A Q V R 
V T F P TAR LRAFVREREKRRALLYKKHNLAQVR 
v t f p tJvIr LRAFVREREKRRALLYKKHNLAOVR 



Majority 



H19G5-Fl.pep 

R19G5-S.pep 

M19G5-S.pep 



Decoration 'Decoration #1' : Box residues that match the Consensus exactly. 
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f\G. 3 
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fvG. H 
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Sequence Comparison of 19G5 , Trio, and MLCK Kinase Domains 



19G5C-term KD 
19G5N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-term KD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-termKD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-termKD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-termKD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-termKD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-termKD 
19G5 N-term KD 

Trio KD 

SM MLCK KD 

19G5 C-term KD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 

19G5 C-term KD 
19G5 N-term KD 
Trio KD 
SM MLCK KD 
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Tissue distribution of Rat H19G5 kinase 
Rat MTN Blot Probed with R19G5 Ral MTN Blot Probed with beta-Actin 
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